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Clor.tridium difficile focussed antibodies 

The present invention is concerned with antibodies either specific for and which confer 
immunity against infection by Clostridium difficile or antibodies produced after challenge 
5 with a C. difficile based vaccine, methods of identifying same, and uses of same. 

C. difficile is a gram-positive anaerobic bacterium, and is deemed a significant human 
pathogen causing a spectrum of diseases ranging from mild diarrhoea to fulminant 
pseudomembranous colitis (PMC) - collectively referred to as C. difficile antibiotic- 
10 associated diarrhoea (CDAD). CDAD is a common, iatrogenic, nosocomial disease 
associated with substantial morbidity and mortality, especially in the elderly. Two factors 
have been assigned main roles in the pathogenesis of CDAD - the suppression of the 
resident intestinal flora by the administration of antibiotics, and the production by the 
bacterium of two high molecular weight toxins, exotoxin A and exotoxin B. 

15 

The bacterium is endemic in hospitals, and studies have shown that approximately one 
third of patients receiving antibiotic treatment in acute-care medical wards were colonised 
by C. difficile while in hospital (Kyne, L., et ah, 2002, Clin. Infect. Dis. 34(3), pp346-53, 
PMID: 11774082). Of these patients, over half went on to develop CDAD while the 

20 remainder were symptomless carriers. CDAD is a major factor in extension of patient 
hospital stay times, and estimates suggest that the cost of this disease in the US exceeds 
$1.1 billion per year (Kyne, L., et ah, Supra). Patients suffering from CDAD respond well 
to a treatment which includes a discontinuation of the inciting antibiotic and treatment with 
either of the antibiotics metronidazole and vancomycin. However, the use of e.g. 

25 vancomycin is one of last resort since it is associated with several problems. Not only may 
it cause nephrotoxicity, ototoxicity, bone marrow toxicity and the red man syndrome, but 
the problem with this treatment regime is that the CDAD often returns after successful 
treatment of the initial episode, and this reoccurrence represents a serious clinical problem. 
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Additionally, there is evidence that C. difficile is becoming resistant to metronidazole and 
partially resistant to vancomycin, demonstrating the need for new alternatives in the 
treatment of CD AD. 

5 Exotoxins A and B which are produced by pathogenic strains of the bacterium are 
cytotoxic, enterotoxic and proinflammatory, and are considered to be the main virulence 
factors of this non-invasive microorganism. However, not all infections with toxigenic 
strains result in disease, prompting the search for additional virulence factors. Bacterial 
surface expressed antigens represent candidate virulence factors, and are also considered 

10 important since such proteins likely mediate the essential functions such as adhesion to the 
epithelial layer of the gut in the first step of colonization or interaction with mediators of 
local immunity. In common with many other bacteria, C. difficile expresses a crystalline 
or paracrystalline surface layer (S-layer) on the outer cell surface. Such S-layers comprise 
proteins or glycoproteins forming a regularly arranged lattice on the external surface of the 

15 bacterium, and have previously been shown to be essential for the virulence of pathogens 
such as Aeromanas salmonicida and Campylobacter fetus. In contrast to most bacteria 
which comprise one S-layer, C. difficile is known to comprise two superimposed 
paracrystalline S-layers, each composed of a glycoprotein subunit which varies slightly in 
apparent molecular weight among different C. difficile strains. Most strains of C. difficile 

20 express two major S-layer proteins (SLPs), one of 32-38 kDa (low-MW SLP) and a second 
of 42-48 kDa (high-MW SLP). The low-MW SLP appears to be immunodominant and is 
the antigen most commonly recognised by patients suffering from CD AD, and is the only 
antigen recognised in EDTA extracts of bacteria by antisera raised in rabbits against whole 
C. difficile cells (Calabi, E. et al., 2001, Mol. Microbiol., 40(5) pi 187-99, PMID: 

25 11401722). 

During the course of microbial infection various adaptive strategies are employed by the 
immune system. One such strategy, and arguably the most important, is the production of 
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an antibody response. Antibodies capable of binding antigens displayed by the infectious 
agent are produced and bind to and allow killing of the microorganism through 
complement activation, recruitment of macrophage and through direct interaction with the 
microbe itself. The therapeutic efficacy of antibodies capable of binding a given antigen 
5 varies and this is reflected in the fact that antibody production by the immune system 
matures during the course of an infection and becomes focussed in the case of a patient 
successfully fighting off an infection. 

The antibody response is elicited by the B cell repertoire where individual B cells each 
10 produce structurally diverse antibody molecules. The actual size of this B cell/antibody 
repertoire is unknown, but it is estimated that the random clonal frequency of reactivity for 
a given antigen may be as high as 1 in 100,000 in cultured B cells (Nobrega, A., et al., Eur 
J Immunol. 1998 Apr;28(4): 1204-15; PMID: 9565360). During the course of infection, 
antibodies capable of binding the pathogen are selected for by changes in the B cell 
15 population resulting in key antibodies being produced in large numbers. The mechanisms 
for these changes include clonal expansion, isotype switching, and somatic mutation of 
immunoglobulin variable regions. B cells responsible for generating antibodies which are 
able to bind a pathogen multiply, thus skewing the B cell repertoire and changing the 
proportions of B cells. 

20 

Non-antibiotic based therapeutic regimes for the treatment/prevention of C. difficile 
infection are based upon vaccination and passive immunization. Vaccination treatment 
comprises administering to a patient either a nucleic acid sequence encoding an 
immunogenic fragment of the C. difficile surface layer protein or a variant or homologue 
25 thereof, or an equivalent polypeptide fragment (as disclosed in WO 02/062379). Passive 
immunotherapy is typically achieved by administering to a patient a monoclonal antibody 
specific to an immunogen produced by a pathogen. In general, passive immunotherapy is 
particularly effective in treating immunocompromised patients who are unable to respond 
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to vaccination, and to patients who need immediate therapy and cannot wait for 
vaccination to take effect. In the case of a C. difficile infection, passive immunization 
relies on the administration to a patient of toxin-neutralizing polyclonal immune globulin, 
(as disclosed in WO 99/2030.4), or antibodies raised against the whole bacterium and the 
5 toxins (as disclosed in WO 96/07430). 

Thus, as can be seen from the prior art, in order to effect treatment of the patient, an 
immunogen is first isolated and purified, administered to test animals, and cells producing 
antibody specific against the immunogen cloned. The range of antibodies produced by the 
10 clones can then be tested for their therapeutic efficacy and a single monoclonal antibody 
selected which is then administered to patients to effect passive immunotherapy. 

There are, however, several problems associated with current passive immunotherapy 
regimes aimed at treating C. difficile infections. For example, passive immunotherapy 

15 requires that there are survivors of the C. difficile infection and patients who have been 
vaccinated. Each batch of antibody can be different leading to difficulties associated with 
standardisation and administration of the imrnunotherapeutic reagent. In addition, the 
problem of inadvertent administration to a patient of adventitious agents (e.g HIV, HBV, 
HCV, or as yet unidentified agents) is a real one, and up-to-date screening of any 

20 imrnunotherapeutic reagent is required. Finally, the strain variability exhibited by C. 
difficile means that a given antibody may only be useful against certain strains of the 
bacterium and not against others. 

Obviously the techniques involved are somewhat complex, inconvenient, expensive and 
25 time consuming. In general they require that an immunogen is isolated from the infecting 
pathogen and used to generate antibody. Simply isolating the immunogen can be extremely 
difficult and time consuming, particularly if it comprises carbohydrate or complex non- 
linear epitopes (i.e. epitopes having secondary, tertiary and/or quaternary structural 
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features) which cannot be synthesised in vitro, and are impossible to isolate and produce 
as an antigen for use as e.g. a vaccine. The SLPs of C. difficile contain a glycoprotein 
subunit which varies in molecular weight between species. It may be the case that C. 
difficile epitopes are only produced in vivo and are not synthesised in vitro, consistent with 
5 for example Neisseria gonorrhoeae infections (the causal agent of gonorrhea) where 
specific antigens are only expressed upon infection of a host. Such antigens fall into the 
general class of cryptoantigens. Furthermore, C. difficile may display highly labile antigens 
which are difficult to work with since during use they simply degrade and the epitope they 
display is lost. With regard to this range of epitopes/antigens whose identification and/or 
10 in vitro use is of great difficulty or impossible, the present invention overcomes these 
disadvantages and provides a solution by providing antibodies whose CDR regions have 
been generated in response to C. difficile epitopes during antibody responses of patients 
infected with C. difficile. 

15 In addition, the prior art typically has to attempt to achieve an equivalent of affinity 
maturation of antibodies by first synthesising a set of candidate antibodies specific to an 
antigen, testing them for their binding characteristics and then modifying the sequences 
of the candidates in order to optimise the binding. A thorough attempt at affinity 
maturation (i.e. optimising antibody binding) can require the synthesis of thousands of 

20 different antibodies, which can be costly and time-consuming. Because the antibodies of 
the present invention are obtained from patients who have either been infected by a 
pathogen displaying the antigen or who have been vaccinated with an antigen, they have 
by their very nature and definition already undergone affinity maturation, as is most clearly 
demonstrated by the sequences of their CDR3 regions of the variable heavy and variable 

25 light chains (i.e. the CDR-H3 and CDR-L3 regions). 
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According to the present invention there is provided an antibody or an antigen binding 
fragment thereof having the CDR-H3 sequence selected from the group consisting of: SEQ 
ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, SEQ ID NO: 30, and SEQ ID NO: 31. 

5 Also provided according to the present invention is an antibody or an antigen binding 
fragment thereof having the CDR-L3 sequence selected from the group consisting of: SEQ 
ID NO: 32, SEQ ID NO: 33, and SEQ ID NO: 34. 

Also provided according to the present invention is an antibody or an antigen binding 
10 fragment thereof having a CDR-H3 sequence selected from the group consisting of: SEQ 
ID NO: 27, SEQ ID NO: 28, SEQ ED NO: 29, SEQ ED NO: 30, and SEQ ID NO: 3 1 , and 
a CDR-L3 sequence selected from the group consisting of: SEQ ID NO: 32, SEQ ID NO: 
33, and SEQ ID NO: 34. 

15 According to the present invention there is also provided a method for identifying 
candidate sequences of at least the CDR3 region of antibodies specific against at least one 
antigen produced by C. difficile during an infection or against a vaccine, comprising the 
steps of: 



20 



(i) 



with B cells isolated from at least one patient who has been infected 
by Clostridium difficile or administered said vaccine, sequencing at 
least the CDR3 region of the VH and/or VL coding regions of said B 
cells; and 



25 



(") 



correlating said sequenced at least the CDR3 regions of the VH and/or 
VL coding regions of said B cells from said at least one patient to 
identify a set of candidate sequences for at least a CDR3 region of 
antibodies specific against said at least one antigen produced by 
Clostridium difficile or against said vaccine, each of said set of 
candidate CDR3 sequences or a sequence having at least 80% 
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homology therewith occurring in total at a frequency of at least 1 
percent in the set of sequences determined at step (i). 

Examples of patients used as a source of B cells in such a method are humans and other 
mammals such as mice, rabbits, rats, baboons, monkeys and apes. 

In certain embodiments of the invention, step (i) may comprise the steps of: 

(i)(a) isolating B cells from at least one patient who has been infected by 

Clostridium difficile or administered said vaccine; and 
(i)(b) sequencing at least the CDR3 region of the VH and/or VL coding 

regions of said B cells; 

The sequences occurring in total at a frequency of at least 1 percent in the set of sequences 
determined at step (i) may be the candidate CDR3 sequences or a sequence having at least 
80% homology, for example 85, 90, 95, 96, 97, 98 or 99% homology therewith. Sequence 
homology is as determined using the BLAST2 program (Tatusova TA et ah, FEMS 
Microbiol Lett. 1999 May 15; 174(2) :247-50; PMID: 10339815) at the National Center for 
Biotechnology Information, USA (www.ncbi.nlm.nih.gov) with default parameters. 

For example, the sequences occurring in total at a frequency of at least 1 percent in the set 
of sequences determined at step (i) may be the candidate CDR3 sequences or a sequence 
having 1 or 2 amino acid changes therefrom. In general, the frequency of a dominant 
sequence is added together with the frequency of any sequence showing >80% homology, 
such that any sequence exhibiting a frequency of 1 % or greater is then deemed a candidate 
sequence. For example, a first sequence may occur at a frequency of 0.7 percent, and first, 
second, third and fourth sequences each having a single amino acid change there from each 
occur at a frequency of 0.1% - the total occurrence is therefore 1.1% and the dominant 
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antibody sequence (occurring at a frequency of 0.7%) is therefore a candidate CDR3 
sequence. 

This is fundamentally different to prior art methods of making a library of antibodies in a 
5 vector such as a phage library where antibodies are isolated by panning and which require 
binding with a specific (known) antigen in a structurally- and charge-specific state. An 
analogy by which the two can be compared would be to say that the prior art making of a 
library and subsequent panning is like going into a library, picking up a book and hoping 
that it is the right one. In comparison, the present invention is like going into a library, 
10 reading all of the books in the library and finding that one book is dominant - that there are 
many copies of it - and that it is relevant to the disease to be treated. 

In addition, the prior art use of library systems has a number of significant problems - in 
particular, some antibodies produced by a library may cause the death of the organism 

15 expressing them and therefore they simply cannot be detected. This is not a particular 
problem when e.g. looking for antibodies specific to cancers, but when one is searching 
for antibodies specific to an antigen from a pathogen which might be homologous to one 
produced by the host expression system (e.g. Escherichia coli) then important antibodies 
cannot be expressed. The use of e.g. E. coli to express libraries of e.g. human antibodies 

20 also suffers from the problem of codon usage - codons used by humans for specific amino 
acids can frequently not be the optimum ones for the same amino acid in E. coli or other 
host systems. This means that an important antibody might not be expressed (or at least not 
in sufficient quantities) since the codons in its sequence are highly inefficient in E. coli, 
resulting in the E. coli being unable to read through and express it in full. Codon 

25 optimisation of antibody libraries is obviously not an option since the libraries would first 
have to be sequenced, which defeats the main advantages of using libraries. Since the 
present invention sequences antibodies directly from a patient, it avoids this problem. 
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The correlation step (ii) may also correlate the occurrence of the candidate sequences 
against their occurrence in patients who have not been infected with Clostridium difficile 
or administered the vaccine, sequences only being determined to be candidate sequences 
if they or a sequence having 80% homology to them occur in total at a frequency of less 
5 than 1 percent in the non-infected/vaccinated patients. 

The at least one antigen produced by the micro-organism may of course be an immunogen. 

The present invention provides unique opportunities for understanding antibody responses 
10 to infection which were not previously available, and provides novel diagnostic and 
therapeutic opportunities. 

In particular, since the methods of the present invention bypass the need to isolate an 
antigen from C. difficile, the difficulties associated with identifying e.g. complex non- 
15 linear epitopes, epitopes containing carbohydrate, or cryptoantigens are avoided. Instead 
the methods facilitate the determination of the identity of antibodies specific against one 
or more antigens produced by the bacterium. As is explained below, the methods of the 
present invention allow the identification of therapeutically effective antibodies and can 
allow the identification of the most important parts of antibody sequences. This is 
20 particularly evident when the B cells of a given patient are sampled at different time points 
during the course of an infection by C. difficile - as the patient's immune system becomes 
focussed on producing therapeutically effective antibodies against the bacterium, a 
focussing of antibody sequences is observed, and the development of variable and non- 
variable regions within the CDR parts of the VH and/or VL sequences is observed. In 
25 addition, the methods of the present invention can allow the identification of antibodies 
best suited to the treatment of C. difficile infection in particular groups of patients, such 
as age, sex and racial groups. Similarly by comparing the antibody sequences of patients 
who have recovered from infection with C. difficile with those that have not recovered 
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from infection, ineffective antibody sequences (which might be produced by both sets of 
patients) can be identified. 

The B cells isolated from the at least one patient can be peripheral B-cell lymphocytes 
5 (PBLs), and can be isolated from a blood sample from the at least one patient. B-cells can 
also be isolated from other sources and the present invention extends to their use. For 
example, B-cells can be isolated from spleen. 

Antibody CDR sequences are determined as detailed in the "Experiments" section using 
10 standard techniques and definitions of their start and end. 

As is detailed below, the antibody sequences are determined from patients with C. difficile 
infections. These sequences can be derived from B cell immunoglobulin mRNA and hence 
reflect expressed antibodies and repertoires therein. The sequences determined according 
15 to the present invention need not be the sequences of whole antibody molecules - they 
comprise at least the VH and/or VL sequences which specify the regions responsible for 
antigen binding. 

From the nucleotide sequences determined by the initial sequencing, putative amino acid 
20 sequences for the VH and/or VL regions can be determined using standard algorithms and 
software packages (e.g. see www.mrc-lmb.cam.ac.uk/pubseq/, the Staden package and 
Gap4 programs; Rodger Staden, David P. Judge and James K. Bonfield. Managing 
Sequencing Projects in the GAP4 Environment. Introduction to Bioinformatics. A 
Theoretical and Practical Approach. Eds. Stephen A. Krawetz and David D. Womble. 
25 Human Press Inc., Totawa, NJ 075 1 2 (2003); Rodger Staden, David P. Judge and James 
K. Bonfield. Analysing Sequences Using the Staden Package and EMBOSS. Introduction 
to Bioinformatics. A Theoretical and Practical Approach. Eds. Stephen A. Krawetz and 
David D. Womble. Human Press Inc., Totawa, NJ 07512 (2003)). These can be further 
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characterised to determine the CDR (Complementarity Determining Region) parts of the 
VH and/or VL sequences, particularly CDR1, CDR2 and CDR3. Methods for determining 
the putative amino acid sequences and identifying CDR regions are well known and 
detailed below. 

As well as the VH and/or VL amino acid sequences, several pieces of additional 
information can be used in the correlation step: 

(i) the strain of C. difficile causing the patient's infection; 

(ii) the time point during the infection process at which antibody 
sequences were sampled 

(iii) patient details - none or more of: sex, race, and age; and 

(iv) the range of complementarity determining regions (CDR) ie the 
variable regions of the antibody that undergo direct antigen 
binding/contact. 

Thus in a method according to the present invention, B cells can be used which have been 
isolated from the at least one patient at a plurality of time points during infection of the at 
least one patient by C. difficile (or post- vaccination), correlation step (ii) correlating the 
time point during infection of the at least one patient by C. difficile at which the B cells are 
isolated. 

Similarly, in a method according to the present invention, B cells can be used which have 
been isolated from at least two patients, at least one of whom has recovered from infection 
by C. difficile, and at least one of whom has not recovered from infection by C. difficile, 
correlation step (ii) correlating the recovery of the at least two patients from infection by 

C. difficile. 
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Similarly, in a method according to the present invention, B cells can be used which have 
been isolated from at least two patients, the patients being infected by strains of C. difficile 
producing the at least one antigen, correlation step (ii) correlating the sequenced at least 
the VH and/or VL coding regions of the B cells to identify a set of candidate sequences for 
5 antibodies, each of which is specific against at least one shared antigen produced by the 
different strains of C. difficile or is specific against different antigens produced by the 
different strains of C. difficile. 

The sequencing of the VH and/or VL regions can also be used to identify the surrounding 
10 antibody framework, and this can be used to determine the antibody isotype. This can then 
be used in the correlation step to determine whether specific antibody isotypes are 
particularly useful. 

By the term "therapy" is meant any treatment which is designed to cure, alleviate, remove 
15 or lessen the symptoms of, or prevent or reduce the possibility of contracting any disorder 
or malfunction of the human or animal body 

The invention is suitable for identifying antibody sequence useful in treating C. difficile 
infection. 

20 

Using the abovementioned additional information fields in the correlation step (ii), it is 
easy to perform the correlation using e.g. only CDR sequences, CDRs from either a given 
patient or a given infection or both, CDRs from a given site or patient group, or CDRs 
from different sampling times over the course of infection. 

25 

These correlations are CDR-specific because these are the hypervariable regions 
responsible for antigen binding and sequence diversity of the antibody repertoire (although 
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thc framework regions can be used in the correlation step as well to reveal the antibody 
isotype). 

The present invention is useful in a wide range of applications, e.g. antibody therapy, and 
5 vaccine studies. 

Antibody Therapy: 

This involves the passive transfer of antibody to non-immune individuals (e.g. patients 
undergoing chemo-/radio-therapy, immunosuppression for organ transplantation, 
10 immunocompromised due to underlying conditions such as diabetes, trauma etc, also the 
very young or very old). 

The present invention can be used to determine the sequences of antibodies conferring 
immunity by looking for over-represented VH and/or VL sequences in patients who have 

15 overcome infection. These protective antibodies can be re-synthesised at the genetic level, 
over-expressed in E. coli (or other expression systems) and purified. The resultant purified 
recombinant antibody can then be administered to patients as a passive immunotherapy. 
Antibodies can also be ordered from commercial suppliers such as Operon Technologies 
Inc., USA (www.operon.com) by simply supplying them with the sequence of the antibody 

20 to be manufactured. 

Therapeutically useful sequences can be identified in a number of ways which can be used 
in the correlation step (ii): 

25 1 ) By looking for antibody sequence over-representation in patients who 

have recovered from C. difficile infection - B cells that produce 
antibodies capable of pathogen binding undergo clonal expansion, 
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hence high frequency antibody sequences are most likely to bind 
pathogen and confer immunity. 

By following alterations in the antibody repertoire over the course of 
C. difficile infection. During infection, antibodies undergo a 
maturation process to improve pathogen binding and this is 
characterised by sequence alterations. Also, B cell clonal expansion 
is more prevalent in the final stages of infection where the infection 
is cleared. The most frequent antibodies in the repertoire are chosen 
as candidates for immunotherapy. Following the maturation process 
the candidate antibody will demonstrate which key amino acid residue 
alterations improve antigen binding and this information can be used 
to improve antibody design. 

Analysis of the repertoires from different patients with C. difficile 
infection can identify shared and identical protective antibodies. 
These antibodies are attractive choices for immunotherapy as their 
occurrence in different patients suggests a strong positive selection in 
their favour; hence an important role in antibody-based immunity. 

Analysis of the repertoires from patients infected with different strains 
of C. difficile. In this situation, if a common antibody sequence is 
found in the repertoires for both strains, the antibody may be useful 
in treating both strains of the bacterium, ie displaying a broad 
spectrum. 



5) 



Analysis of the repertoires for affinity maturation of sequences. 



WO 2004/094474 



PCT/GB2004/001619 



-15- 

Vaccination Studies: 

Vaccination protects against infection by priming the immune system with 
pathogen-derived antigen(s). Vaccination is effected by a single or repeated exposures to 
the pathogen-derived antigen(s) and allows antibody maturation and B cell clonal 
expansion without the deleterious effects of the full-blown infectious process. T cell 
involvement is also of great importance in effecting vaccination of patients. The present 
invention can be used to monitor the immunisation process with experimental C. difficile 
vaccines. Subjects are given the experimental vaccine and VH and/or VL sequences are 
amplified from the patient and the antibody repertoire analysed as described above. 
Qualitative and quantitative assessment of the vaccination process is possible: 

1) The VH/VL repertoire of a group of patients who have been 
administered an experimental C. difficile vaccine is assessed. A 
precise molecular dissection of the resulting antibody response to the 
vaccine can be performed to determine (a) clonal expansion of 
protective antibodies, (b) protective antibody production in different 
populations (for example differing ethnic groups with different 
genetic backgrounds, as well as e.g. age and sex groups), and (c) the 
long term effect of the vaccine i.e. the antibody response over the long 
term - long term antibody memory in the immune system as well as 
any autoimmune defects; 

2) Where vaccination results in an increased frequency of a given 
antibody, the antibody sequence can easily be cloned and expressed 
and used in animal models of infection. If the antibody is protective, 
it may be useful in itself as an immunotherapy, but this also 
demonstrates that the vaccine is likely to be protective without 
subjecting the human subject to experimental infection. 
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Also provided to the present invention is a method of manufacture of a medicament for the 
treatment of a C. difficile infection which produces at least one antigen, comprising the 
steps of: 

(i) performing a method according to the present invention to identify a 
set of candidate sequences for antibodies specific against the at least 
one antigen produced by C. difficile; and 

(ii) synthesising at least one antibody comprising a said candidate 
sequence specific against the at least one antigen produced by C. 
difficile. 

Medicaments and methods of treatment according to the present invention will be readily 
apparent to one skilled in the art. Medicaments may be prepared using pharmaceutically 
acceptable carriers, diluents or excipients (Remington's: The Science and Practice of 
Pharmacy (1995)Mack Publishing Company, Easton, PA, USA). The medicaments and 
methods of treatment may be effected using a pharmaceutically effective amount of the 
antibody/antigen-binding fragment. Appropriate dosages will be readily apparent to one 
skilled in the art and may be readily determined, for example by means of dose-response 
experiments 

Also provided according to the present invention is a method of treatment of an infection 
of a patient by C. difficile which produces at least one antigen, comprising the steps of: 

(i) performing a method according to the present invention to identify a 
set of candidate sequences for antibodies specific against the at least 
one antigen produced by C. difficile; 

(ii) synthesising at least one antibody comprising a said candidate 
sequence specific against the at least one antigen produced by C. 
difficile; and 
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(iii) administering a therapeutically effective quantity of said at least one 
synthesised antibody to said patient. 

Also provided according to the present invention is a method of producing a database 
5 which identifies candidate sequences for antibodies specific against at least one antigen 
produced by C. difficile, comprising the steps of: 

(i) performing a method according to the present invention to identify a 
set of candidate sequences for antibodies specific against the at least 
one antigen produced by C. difficile; and 

10 (ii) storing the data produced by said method in said database. 

Also provided is a method of generating a report which identifies candidate sequences for 
antibodies specific against at least one antigen produced by C. difficile, comprising the 
steps of: 

15 (i) performing a method according to the present invention to identify a 

set of candidate sequences for antibodies specific against the at least 
one antigen produced by C. difficile; and 

(ii) producing a report comprising the data produced by said method of 
step (i). 

20 

Also provided according to the present invention is a method for determining the efficacy 
of a vaccine, comprising the steps of: 

(i) with B cells isolated from at least one patient who has been 
administered said vaccine, sequencing at least the CDR3 region of the 

25 VH and/or VL coding regions of said B cells; and 

(ii) correlating said sequenced at least the CDR3 region of the VH and/or 
VL coding regions of said B cells to identify a set of candidate 
sequences for at least the CDR3 region of antibodies specific against 
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said vaccine, each of said set of CDR3 candidate sequences or a 
sequence having at least 80% homology therewith occurring in total 
at a frequency of at least 1 percent in the set of sequences determined 
at step (i). 

In certain embodiments of the invention, step (i) above may comprise the steps of: 
(i)(a) administering said vaccine to at least one patient; 
(i)(b) isolating B cells from said at least one patient; and 
(i)(c) sequencing at least the CDR3 region of the VH and/or VL coding 
regions of said B cells. 

Correlation step (ii) may comprise correlating said sequenced at least a CDR3 region of 
the VH and/or VL coding regions of said B cells with sequenced at least a CDR3 region 
of the VH and/or VL coding regions of B cells isolated from at least one patient who has 
been infected with C. difficile against which vaccination with said vaccine is intended to 
stimulate a protective immune response. 

As described above, additional information may be used in the correlation of step (ii), 
including: 

(i) the time since administration of the vaccine to the patient; 

(ii) patient details - none or more of: sex, race, and age; and 

(iii) the range of complementarity determining regions (CDR) ie the 
variable regions of the antibody that undergo direct antigen 
binding/contact. 

The antibody sequences determined from patients who have been administered the vaccine 
can be compared with antibody sequences isolated from patients who have been infected 
with C. difficile against which the vaccine is intended to stimulate a protective immune 



WO 2004/094474 



PCT/GB2004/001619 



- 19- 

response in patients. Thus it is possible to determine whether the vaccine results in the 
generation of antibodies which are also produced in response to infection by C. difficile 
itself. In particular, the comparison can be made using antibody sequences determined 
from patients who have recovered from infection by C. difficile. 

5 

In particular the method may determine the efficacy of the vaccine in stimulating a 
protective immune response against C. difficile against which vaccination with the vaccine 
is intended to stimulate a protective immune response. 

10 Also provided is a method of producing a database which identifies the efficacy of a 
vaccine, comprising the steps of: 

(i) performing a method according to the present invention to determine 
the efficacy of said vaccine; and 

(ii) storing the data produced by said method in said database. 

15 

Also provided according to the present invention is a method of generating a report which 
identifies the efficacy of a vaccine, comprising the steps of: 

(i) performing a method according to the present invention to determine 
the efficacy of said vaccine; and 
20 (ii) producing a report comprising the data produced by said method. 

Also provided according to the present invention is a diagnostic test method for identifying 
a Clostridium difficile infection in a patient, comprising the steps of: 

(i) with B cells isolated from said patient, sequencing at least the CDR3 
25 region of the VI I and/or VL coding regions of said B cells; 

(ii) comparing said sequenced at least said CDR3 region of the VH and/or 
VL coding regions of said B cells with a set of sequences for at least 
the CDR3 region of antibodies specific against Clostridium difficile, 
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and determining whether each of said set of CDR3 sequences or a 
sequence having at least 80% homology therewith occurs in total at 
a frequency of at least 1 percent in the set of sequences determined at 
step (i); and 

(iii) correlating the results of comparison step (ii) to determine the 
presence or absence of a Clostridium difficile infection in said patient. 

Also provided according to the present invention is a diagnostic test method for 
determining the susceptibility of a patient to Clostridium difficile infection, comprising the 
steps of: 

(i) with B cells isolated from said patient, sequencing at least the CDR3 
region of the VH and/or VL coding regions of said B cells; 

(ii) comparing said sequenced at least said CDR3 region of the VH and/or 
VL coding regions of said B cells with a set of sequences for at least 
the CDR3 region of antibodies specific against Clostridium difficile, 
and determining whether each of said set of CDR3 sequences or a 
sequence having at least 80% homology therewith occurs in total at 
a frequency of at least 1 percent in the set of sequences determined at 
step (i); and 

(iii) correlating the results of comparison step (ii) to determine the 
susceptibility of said patient to Clostridium difficile infection. 

The diagnostic test method could be used to sequence the CDR3 sequences of patients in 
for example a hospital ward, to determine which patients were susceptible to infection by 
the bacterium. The presence of C. difficile specific antibodies in a sample (as determined 
by sequencing of CDR3 regions) would likely indicate that such patients were capable of 
mounting a protective immune response toward the bacterium and would not require 
antibiotic treatment. In contrast, those patients identified who had not produced C. difficile 
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specific antibodies could be identified as being immunocompromised and therefore 
candidates for antibiotic treatment during the course of a C. difficile infection. 

Also provided according to the present invention is a diagnostic test kit for performing a 
5 diagnostic test method according to the present invention. The kit may contain reagents 
(e.g. PGR primers) which would enable a person skilled in the art to identify CDR3 
sequences from a patient. 

As regards antibodies usable in the present invention, particularly synthetic antibodies (the 
10 term "antibody" is also considered to incorporate antigen binding fragments of antibodies 
unless the context otherwise requires), the antibody may be a whole antibody or an antigen 
binding fragment thereof and may in general belong to any immunoglobulin class. Thus, 
for example, it may be an IgM, IgG or an IgA antibody. The antibody or fragment may be 
of animal, for example, mammalian origin and may be for example of murine, rat, sheep 
15 or human origin. It may be a natural antibody or a fragment thereof, or, if desired, a 
recombinant antibody fragment, i.e. an antibody or antibody fragment which has been 
produced using recombinant DNA techniques. 

Particular recombinant antibodies or antibody fragments include, (1) those having an 
20 antigen binding site at least part of which is derived from a different antibody, for example 
those in which the hypervariable or complementarity determining regions of one antibody 
have been grafted into the variable framework regions of a second, different antibody (as 
described in, for example, EP 239400); (2) recombinant antibodies or fragments wherein 
non-Fv sequences have been substituted by non-Fv sequences from other, different 
25 antibodies (as described in, for example, EP 171496, EP 173494 and EP 194276); or (3) 
recombinant antibodies or fragments possessing substantially the structure of a natural 
immunoglobulin but wherein the hinge region has a different number of cysteine residues 
from that found in the natural immunoglobulin but wherein one or more cysteine residues 
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in a surface pocket of the recombinant antibody or fragment is in the place of another 
amino acid residue present in the natural immunoglobulin (as described in, for example, 
WO 89/01782 and WO 89/01974). 

5 Teachings of texts such as Harlow, E. and Lane, D. ("Using Antibodies: A Laboratory 
Manual", Cold Spring Harbor Laboratory Press, New York, 1998) further details 
antibodies, antibody fragments, their preparation and use. 

The antibody or antibody fragment may be of polyclonal or monoclonal origin. It may be 
10 specific for at least one epitope. 

Antigen binding antibody fragments include, for example, fragments derived by proteolytic 
cleavage of a whole antibody, such as F(ab')2, Fab' or Fab fragments, or fragments 
obtained by recombinant DNA techniques, for example Fv fragments (as described, for 
15 example, in WO 89/02465). 

Where it is desired to produce recombinant antibodies according to the invention these may 
be produced using, for example, the methods described in EP 171469, EP 173494, EP 
194276 and EP 239400. Antibody fragments may be produced using conventional 
20 techniques, for example, by enzymatic digestion with pepsin or papain. 

Antibodies according to the invention may be labelled with a detectable label or may be 
conjugated with an effector molecule, for example a drug eg. an antibacterial agent or a 
toxin or an enzyme, using conventional procedures and the invention extends to such 
25 labelled antibodies or antibody conjugates. 

The contents of each of the references discussed herein, including the references cited 
therein, are herein incorporated by reference in their entirety. 
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Where "PMID:" reference numbers are given for publications, these are the PubMed 
identification numbers allocated to them by the US National Library of Medicine, from 
which full bibliographic information and abstract for each publication is available at 
www.ncbi.nlm.nih.gov. 

The invention will be further apparent from the following description, with reference to 
the several figures of the accompanying drawings, which show, by way of example only, 
forms of identifying candidate sequences for antibodies specific against at least one 
antigen and of determining the efficacy of a vaccine. 

Of the Figures: 

Figure 1 shows the general principles for the isolation and DNA 
sequencing of VH and/or VL antibody gene fragments from B 
cells. Reference numeral 10 indicates primary PCR, reference 
numeral 20 the cloning into a DNA sequencing vector using 
the T-tailing principle - random orientation, and reference 
number 30 the determining of the nucleotide sequence of 
forward and reverse strands using Ml 3 forward and reverse 
primers; and 

Figure 2 shows a schematic depiction of resynthesised recombinant 
antibody gene cassette. VH and VL regions are linked with a 
glycine serine-rich linker. Each variable domain contains three 
Complementarity Determining Regions (CDRs) which 
participate in antigen binding. 
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EXPERIMEWTS 

The experiments below detail methods for identifying antibody VH and/or VL CDR 
sequences that confer immunity in patients with C. difficile infection. These sequences can 
5 then be used to produce synthetic antibodies and these synthetic antibodies are suitable for 
administration to patients for therapy. Also detailed are the VH and VL CDR3 sequences 
identified by the methods. 

Also detailed are methods of determining the efficacy of a vaccine in generating a desired 
10 immune response in a patient. 

Unless stated otherwise, all procedures were performed using standard protocols and 
following manufacturer's instructions where applicable. Standard protocols for various 
techniques including PCR, molecular cloning, manipulation and sequencing, the 

15 manufacture of antibodies, epitope mapping and mimotope design, cell culturing and 
phage display, are described in texts such as McPherson, M.J. et al. (1991, PCR: A 
practical approach, Oxford University Press, Oxford), Sambrook, J. et al. (1989, Molecular 
cloning: a laboratory manual, Cold Spring Harbour Laboratory, New York), Huynh and 
Davies (1985, "DNA Cloning Vol I - A Practical Approach", IRL Press, Oxford, Ed. D.M. 

20 Glover), Sanger, F. et al. (1977, PNAS USA 74(12): 5463-5467), Harlow, E. and Lane, 
D. ("Using Antibodies: A Laboratory Manual", Cold Spring Harbor Laboratory Press, New 
York, 1998), Jung, G. and Beck- Sickinger, A.G. (1992, Angew. Chem. Int. Ed. Eng., 31: 
367-486), Harris, M.A. and Rae, I.F. ("General Techniques of Cell Culture", 1997, 
Cambridge University Press, ISBN 0521 573645), "Phage Display of Peptides and 

25 Proteins: A Laboratory Manual" (Eds. Kay, B.K., Winter, J., and McCafferty, J., Academic 
Press Inc., 1996, ISBN 0-12-402380-0). 
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Reagents and equipment useful in, amongst others, the methods detailed herein are 
available from the likes of Amersham (www.amersham.co.uk), Boehringer Mannheim 
(www.boehringer-ingeltheim.com), Clontech (www.clontech.com), Genosys 
(www.genosys.com), Millipore (www.millipore.com), Novagen (www.novagen.com), 
5 Perkin Elmer (www.perkinelmer.com), Pharmacia (www.pharmacia.com), Promega 
(www.promega.com), Qiagen (www.qiagen.com), Sigma (www.sigma-aldrich.com) and 
Stratagene (www.stratagene.com). 

The term "antibody" in its various grammatical forms is used herein to refer to 
10 immunoglobulin molecules and immunologically active portions of immunoglobulin 
molecules, i.e., molecules that contain an antibody combining site or paratope. Such 
molecules are also referred to as "antigen binding fragments" of immunoglobulin 
molecules. 

15 Illustrative antibody molecules are intact immunoglobulin molecules, substantially intact 
immunoglobulin molecules and those portions of an immunoglobulin molecule that contain 
the paratope, including those portions known in the art as Fab, Fab', F(ab')2, scFv and F(v). 

The term "antibody combining site" refers to that structural portion of an antibody 
20 molecule comprised of a heavy and light chain variable and hypervariable regions that 
specifically binds (immunoreacts with) antigen. 

The identification and sequencing of B cells producing antibodies and the analysis of the 
resulting data comprises the following basic steps: 
25 (1) Isolation of VH and/or VL coding regions from circulatory B cells of 

human patients. 

(2) Determining the Nucleotide sequence of VH and/or VL repertoires. 

(3) Determination of VH and/or VL primary amino acid sequences. 
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(4) Extraction of CDR regions in silico - incorporation into the database 

(5) Detection of dominant CDR & framework regions in VH and/or VL 
repertoire. 

(6) Construction & Production of therapeutic recombinant antibodies 
from dominant antibody sequences. 

1. Isolation of VH and/or VL coding regions from circulatory B cells of human 
patients 

For this method, human patients with C. difficile infection were selected as donors of 
immunised B cells. The criteria for selection were: 

(a) The patient must exhibit a pronounced antibody response to C. difficile, 
detectable for example by Western blotting. Samples of antibodies were collected from a 
patient blood sample. Samples were diluted and tested for immunoreactivity by Western 
blotting using antigen(s) derived from C. difficile. In such an assay, patients exhibiting 
strong antibody responses showed pronounced recognition/antibody binding of C. difficile 
antigens as detected using anti-human polyclonal detection antibodies. 

(b) The patient has survived the course of C. difficile infection - this improves 
the chance of generating B cell responses for antibodies capable of neutralizing the 
bacterium. 

Peripheral B-cell lymphocytes (PBLs) were collected from infected patient blood samples. 
For this heparinised blood was diluted in PBS (20 ml total) and overlaid onto a 15 ml 
cushion of Ficol Hypaque (Pharmacia; unless stated otherwise, all chemicals and culture 
media were purchased from Sigma, UK) in a 30 ml centrifuge tube. The PBLs were then 
collected by centrifugation (400 x g, 5 minutes) and washed in PBS and harvested by 
centrifugation again. RNA was prepared from PBLs using the QuickPrep mRNA 
purification kit (Pharmacia) exactly according to the manufacturers instructions. The 
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isolated mRNA was used to prepare cDNA via a reverse transcriptase reaction (Promega 
cDNA synthesis kit). For this 2ug of mRNA was re-suspended in 16 uL nuclease-free 
water and heated to 65 °C for 3 minutes (to denature secondary structure) and then 
immediately chilled on ice for 1 minute. The mRNA was then added to the following 
5 cocktail: 8 \xL 25 mM MgCl 2 , 4 \xL dNTP mix (10 mM with respect to each ribonucleotide 
triphosphate), 1 u.L RNAsin 40 u ul/ 1 stock solution, 1.2 uL AMV reverse transcriptase 
(25 u uL" 1 stock solution), 6 \xL cDNA 10 pmol u"L primer (see Figure 1 - cDNA 
synthesis). The mixture was incubated at 42 °C for 1 hour and then incubated at 100 °C 
for 3 minutes to stop the reaction. 

10 

DNA coding for antigen binding regions was then amplified by PGR using the cDNA 
prepared from patients PBLs. For this, the cDNA was used in the following PCR reaction 
to produce either heavy chain or light chain-derived antibody variable region DNA (see 
Figure 1): 2.5 |aL cDNA, 33 uL water, 4 uL dNTP mix (25 mM with respect to each 

15 deoxynucleotide triphosphate), 5 uL Taq reaction buffer (Perkin Elmer), 2.5 uL of an 
equimolar primer mix (final concentration of 20 pmol with respect to each primer) and 0.5 
u.L Taq DNA polymerase ( 1 u uL ~\ Perkin Elmer Corp). The forward and reverse primers 
used in these reactions are depicted in Figure 1, and their respective nucleotide sequences 
are listed in Table 1. PCR reaction conditions were 94 °C for 1 minute, 57 °C for 1 minute 

20 and 72 °C for 2 minutes for a total of 30 cycles, with an extended denaturation (94 °C for 
5 minutes) prior to cycle 1 and an additional extension step after the end of cycle 30 (72 
°C for 10 minutes). PCR was performed using a Perkin Elmer 9700 GeneAmp PCR 
machine. 5 u.L of the PCR reaction was run on a 1% agarose gel to check the amplification 
of the expected 393 base pair (bp) product. The remaining product was run on a 0.8% low 

25 melting point (LMP) agarose gel and the 393 bp band was excised using a clean scalpel 
blade. DNA was extracted from the agarose gel slice using a GeneClean II Kit (Anachem, 
Luton, UK). 
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The PGR resulted in two fragment types -derived from VH and VL regions respectively, 
depending on the PGR primer sets used (for details of primers sets for antibody genes and 
a PGR schematic see Table 1 and Figure 1, respectively). 



5 Table 1 : Primers used in generating and sequencing VH and VL gene fragments. All 
sequences are given in the 5' to 3' direction. Use of primers is described above and in 
Figure 1. 



Primers used in cDNA synthesis reactions SEQ ID NO: 

HuIgGl-4F0R GTC CAC CTT GGT GTT GCT GGG CTT 01 

HuCLFOR AGA CTC TCC CCT GTT GAA GCT CTT 02 



25 



Primers for primary PGR 
















SEQ ID NO: 


HuJHl - 2 FOR 


TGA 


GGA 


GAC 


GGT 


GAC 


CAG 


GGT 


GCC 


03 


HU JH3 FOR 


TGA 


AGA 


GAC 


GGT 


GAC 


CAT 


TGT 


CCC 


04 


HuJH4 -5 FOR 


TGA 


GGA 


GAC 


GGT 


GAC 


CAG 


GGT 


TCC 


05 


HUJH6FOR 


TGA 


GGA 


GAC 


GGT 


GAC 


CGT 


GGT 


CCC 


06 


HuVHlaBACK 


CAG 


GTG 


CAG 


CTG 


GTG 


CAG 


TCT 


GG 


07 


HuVH2bBACK 


CAG 


GTG 


AAC 


TTA 


AGG 


GAG 


TCT 


GG 


08 


HuVH3 aBACK 


CAG 


GTG 


CAG 


CTG 


GTG 


GAG 


TCT 


GG 


09 


HUVH4 aBACK 


CAG 


GTG 


CAG 


CTG 


CAG 


GAG 


TCG 


GG 


10 


HuVH4bBACK 


CAG 


GTG 


CAG 


CTA 


CAG 


CAG 


TGG 


GG 


11 


HUVH5 aBACK 


GAG 


GTG 


CAG 


CTG 


TTG 


CAG 


TCT 


GC 


12 


HuVH6 aBACK 


CAG 


GTA 


CAG 


CTG 


CAG 


CAG 


TCA 


GG 


13 


HuJKlFOR 


ACG 


TTT 


GAT 


TTC 


CAC 


CTT 


GGT 


CCC 


14 


Hu JK2 FOR 


ACG 


TTT 


GAT 


CTC 


CAG 


CTT 


GGT 


CCC 


15 


Hu JK3 FOR 


ACG 


TTT 


GAT 


ATC 


CAC 


TTT 


GGT 


CCC 


16 


Hu JK4 FOR 


ACG 


TTT 


GAT 


CTC 


CAC 


CTT 


GGT 


CCC 


17 


Hu JK5 FOR 


ACG 


TTT 


ATT 


CTC 


CAG 


TCG 


TGT 


CCC 


18 


HuVKlBACK 


GAC 


ATC 


CAG 


ATG 


ACC 


CAG 


TCT 


CC 


19 


Hu VK2 B AC K 


GAT 


GTT 


GTG 


ATG 


ACT 


CAG 


TCT 


CC 


20 


HuVK3 BACK 


GAA 


ATT 


GTG 


TTG 


ACG 


CAG 


TCT 


CC 


21 


HuVK4 B AC K 


GAC 


ATC 


GTG 


ATG 


ACC 


CAG 


TCT 


CC 


22 


HUVK5BACK 


GAA 


ACT 


ACA 


CTC 


ACG 


CAG 


TCT 


CC 


23 


HuVKSBACK 


GAA 


ATT 


GTG 


CTG 


ACT 


CAG 


TCT 


CC 


24 


Primers for DNA 


Sequencing 
















Ml 3 forward 


GTA 


AAA 


CGA 


CGG 


CCA 


GT 






25 


M13 reverse 


AAC 


AGC 


TAT 


GAC 


CAT 


G 






26 



40 
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VH and/or VL gene fragments were cloned into cloning vector pGEM-T easy (Promega 
Corporation) to facilitate DNA sequencing. For this 3 u.g PGR product was prepared for 
restriction using QIAquick PGR purification spin columns (Qiagen, UK) according to the 
manufacturers instructions. DNA was eluted from the spin column in 40 uL buffer EB. 
5 Purified PGR product (2 uL) was mixed with 1 uL pGEM-T easy vector, 6 uL water and 
1 uL DNA ligase and the mixture ligated for 1 h at room temperature. Ligations were then 
transformed into electrocompetent E. coli TGI cells (Stratagene) by electroporation, and 
plated out onto agar plates containing Ampicillin 100 ug ml* 1 IPTG (100 uM) and X-gal 
(0.006 % w/v). Colonies were allowed to grow overnight at 37 °C and then stored at 4 °C. 
10 Recombinant colonies are identified as white colonies on this media. 

2. Determining the nucleotide sequence of VH and/or VL repertoires 

DNA was first prepared from VH and/or VL recombinant E. coli clones. For this, 
individual colonies were each transferred to 1.2 ml LB broth supplemented with 

15 Ampicillin 100 ug ml" 1 using a 96 well plate format. Cultures were the grown at 37 °C for 
24 hours. Bacterial cells were harvested by centrifugation at 4,000 x g, 30 minutes and the 
supernatants discarded. Plasmid DNA was prepared using Wizard SV 96 plasmid 
purification kits (Promega Corporation) essentially following the manufacturer's 
instructions. Yields of plasmid DNA were typically in the order of 5 ug per 1.2 ml starter 

20 culture. 

DNA sequencing reactions were performed using the DYEnamic ET dye terminator cycle 
sequencing kit (Amersham Pharmacia Biotech). Purified plasmid DNA (0.5 ug) was mixed 
with 8 uL DYEnamic ET terminator reagent premix and 1 uL M13 forward or reverse 
25 primer (5 uM) in a total reaction volume of 20 uL. Thermal cycling was then performed 
using a GeneAmp PGR system 9700 (Perkin Elmer) with the following parameters: 95 °C, 
20 seconds; 50 °C, 15 seconds; 60 °C, 1 minute; 30 cycles). Reactions were performed 
using 96 well format non-skirted ELISA plates (AB Gene). Unincorporated dye terminator 
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were removed using precipitation. For this, ethanol samples were mixed with 2 uL 7.5 M 
ammonium acetate and 55 uL of 100 % ethanol and centrifuged at 3000 g for 30 minutes. 
DNA pellets were washed with 70 % ethanol and re-suspended in 20 uL loading solution. 
Reactions were sequenced using a MegaBACE 1000 DNA sequencer (Amersham 
5 Pharmacia Biotech) following the manufacturers instructions (2 kV injection voltage for 
30 s with electrophoresis at 6 kV for 200 minutes). Chromatograms were exported using 
the .scf file format for finishing and archiving. 

3. Determination of VH and/or VL primary amino acid sequences 

10 DNA sequences of VH and/or VL were determined in both forward and reverse strands 
(using Ml 3 forward and reverse primers respectively) and compared in order to highlight 
discrepancies and maintain a high degree of accuracy. This was done using the Staden 
suite of programs (Staden, R. (1996) The Staden Sequence Analysis Package. Molecular 
Biotechnology 5, 233-241). First, sequence .scf files were entered into the PREGAP 

15 program where vector sequence was stripped off and the quality of the sequence was 
assessed. Poor quality sequences where the DNA sequence was ambiguous were rejected 
and re-sequenced. Forward and reverse strands were matched using the GAP program to 
highlight and resolve areas containing sequencing artefacts. The orientation of the VH 
and/or VL sequence was noted and reverse complemented where necessary to produce only 

20 forward reading frame orientations for translation. VH and VL gene sequences were then 
translated into amino acid sequences. Each sequence was given a unique identifier (name). 

4. Extraction of CDR regions in silico - incorporation into the database 

A general teaching of identifying CDR regions is at www.bioinf.org.uk/abs/ 

25 



The following set of rules will allow the definition of the CDRs in an antibody sequence. 
There are rare examples where these virtually constant features do not occur (for example 
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the human heavy chain sequence EU does not have Trp-Gly after CDR-H3). The Cys 
residues are the best-conserved feature. 



CDR-L1 
Start 

Residue before 
Residue after 

Length 



Approx. residue 24 
always a Cys 

always a Trp. Typically Trp-Tyr-Gln, but also, Trp-Leu-Gln, 
Trp-Phe-Gln, Trp-Tyr-Leu 
10 to 17 residues 



CDR-L2 

Start 

Residues before 
Length 



always 16 residues after the end of CDR-L1 
generally Ile-Tyr, but also, Val-Tyr, Ile-Lys, Ile-Phe 
always 7 residues 



CDR-L3 

Start 

Residue before 
Residues after 
Length 



always 33 residues after end of CDR-L2 
always Cys 

always Phe-Gly-Xaa-Gly (SEQ ID NO: 92) 
7 to 1 1 residues 



CDR-H1 

Start 

Residues before 
25 Residues after 
Length 



Approx. residue 26 (always 4 after a Cys) 

always Cys-Xaa-Xaa-Xaa (SEQ ID NO: 93) 

always a Trp. Typically Trp-Val, but also, Trp-Ile, Trp-Ala 

10 to 12 residues 



CDR-H2 
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Start always 15 residues after the end of CDR-H1 

Residues before typically Leu-Glu-Trp-Ile-Gly (SEQ ID NO: 94), but a number of 
variations 

Residues after Lys/Arg-Leu/Ile/Val/Phe/Thr/Ala-Tlu/SerTle/Ala 
Length 16 to 19 residues; 



CDR-H3 

Start always 33 residues after end of CDR-H2 (always 2 after a Cys) 

Residues before always Cys-Xaa-Xaa (typically Cys-Ala-Arg) 

10 Residues after always Trp-Gly-Xaa-Gly (SEQ ID NO: 95) 

Length 3 to 25 residues 



5. Detection of dominant CDR & framework regions in VH and/or VL repertoire 

The database was constructed using SQL Server Database software (Microsoft). This 
15 allowed the database to be queried using SQL and allowed CDR1, CDR2 and CDR3 
sequences to be extracted from any range of database VH and/or VL sequences in FASTA 
format. Extracted CDRs were then subject to multiple alignment using CLUSTAL X in 
such a way so that identical or very similar CDRs are grouped together in blocks 
(Thompson, J.D., Higgins, D.G. and Gibson, TJ. (1994) CLUSTAL W: improving the 
20 sensitivity of progressive multiple sequence alignment through sequence weighting, 
positions-specific gap penalties and weight matrix choice. Nucleic Acids Research, 
22:4673-4680). From this the frequency of recurring CDRs was determined and CDR- 
dominance in the immunoglobulin repertoire was established. Alternatively, a graphical 
interface was employed using Probiosys (proBionic EMEA, Amsterdam). ProBiosys 
25 prepares dendrograms from CLUSTAL .phy files and allows a rapid visual interpretation 
of the relationships among large numbers of aligned sequences. The whole process was 
performed for CDRs from the same patient sampled at different points during an illness, 
for CDRs from patients with different infections, or from sex-matched, age-matched or 
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race-matched patient groups, depending on the search and the additional information added 
to the database entries. 

5 

6. Construction & production of therapeutic recombinant antibodies from dominant 
antibody sequences 

Once dominant antibody sequences have been recognised in a given repertoire, the 
information can be used to infer the presence of CDRs and frameworks that confer 

10 immunity. Selected VH and/or VL sequences can be identified and their gene sequences 
resynthesised using synthetic oligonucleotides. This is important as it also allows the 
human-derived gene sequence to be codon-optimised for E. coli in order to improve 
protein expression. Dominant VH and/or VL sequences can be spliced together using a 
spacer (linker) sequence (Figure 2). This gene cassette, termed a scFv, can he resynthesised 

15 to include terminal Ndel and Notl restriction sites for cloning into expression vector 
pET29b (Novagen). ScFv DNA can be cut with Ndel (cuts in VH) or Notl (cuts in VL) 
using the following reaction: 40 ul DNA (4 ug), 10 ul of Restriction enzyme buffer D 
(Promega), 47 ul water and 2 ul Ndel and Notl enzyme (10 u ul" 1 ; Promega) with 
digestion for 4 h at 37 °C. DNA can be then fractionated on 0.7% agarose TAE gels and 

20 the digested DNA excised from the gel and purified from the agarose slice using the 
Geneclean II kit (Bio 101) exactly according to the manufacturers instruction. pET29b 
vector DNA can be cut with Ndel and Notl as described above. 1 ug vector can be mixed 
with 1 ug restricted VL DNA resuspended in 8.5 ul water and ligated by addition of 1 ul 
10 x ligation buffer (Boehringer Mannheim) and 0.5 ul DNA ligase (3 u ul" 1 ; Boehringer 

25 Mannheim), followed by ligation overnight at 14 °C. 

The entire ligation can be transformed into E. coli TG2 cells (Stratagene, UK) by 
electroporation. Transformants can be plated out onto agar plates containing Tetracycline 
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(25 ug ml" 1 - selective for recA ::TnlO on the chromosome of TG2) and Kanamycin (50 
ug ml" 1 - selective for pET29). Recombinant plasmid DNA prepared from individual 
clones can be checked for scFv sequence by digestion with Ndel and Noil as described 
above. 

5 

Recombinant scFvs can be over-expressed in E. coli strain JM109(DE3) and purified thus 
enabling biochemical and biological characterisation. Recombinants can be spread on LB 
plates supplemented with 50 ug ml" 1 kanamycin and a single colony can be used to 
inoculate a 10 microlitre starter culture of LB broth with 50 ug ml" 1 kanamycin. For this, 

10 a 1 L expression culture can be prepared in the presence of kanamycin for 4-5 h at 37 °C 
with shaking at 300 rpm. At OD 600 =1, induction can be performed using IPTG and the 
cells can be grown with vigorous aeration for a further 3-5 h. Cells can then be harvested 
by centrifugation (4000 x g, 15 minutes, 4 °C). The cell pellet can be resuspended in 10 
ml fresh (4 °C) lysis buffer (below) and stored at -20 0 C overnight prior to cell breakage 

15 using the 25 ml X-Press system (AB Biox) 



LYSIS BUFFER 

50mMTris HC1 pH 8.0 
1 mM EDTA 
20 100 mM KC1 

0. 1 mM AEBSF (amino ethyl benzene sulphonic acid) 



The lysate can be centrifuged in an Oakridge tube (24 300 x g, 4 °C, 30 minutes). The 
pellet can be resuspended in 20 ml of ice cold lysis buffer on ice using a Silverson lab 
25 blender. The lysate can be split into 8 x Oakridge tubes and each aliquot can be diluted to 
30 ml (total 120 ml) in ice-cold Lysis Buffer. Inclusion bodies can be pelleted (24 300 x 
g, 4 °C, 30 minutes) and the supernatant discarded. Each of the 8 pellets can be 
resuspended in 30 ml of lysis buffer, and the inclusion bodies can be harvested by 
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centrifugation again (24 300 x g, 4 °C, 30 minutes). This wash/centrifugation step can be 
performed five times in total to clean the inclusion body fraction. Each pellet can be 
resuspended in 15 ml of ice-cold water and the inclusion bodies can be stored at -20 °C. 

5 For refolding, 200 ml solution of 2% (w/v) N-lauryl sarcosine (NLS) in 50 mM Tris HC1 
pH 9.0 can be prepared as follows. 4 g NLS can be solubilised in 100 ml water with 
stirring. 10 ml of 1M Tris pH 9.0 stock solution can be added and made up to 195 ml with 
water. 5 ml of inclusion body slurry can be added and stirred vigorously for 30 minutes at 
room temperature. 

10 

CuCl 2 can be added to a concentration of 100 \xM. This serves as a catalyst for oxidation. 
The refolding reaction can be transferred to 4 °C and stirred vigorously for 2 days to 
promote aeration. The refolding reaction can be vacuum filtered through a 0.44 |liM 
vacucup bottle top filter unit (90 mm diameter, Gellman Sciences) and the filtrate 

15 transferred to a Pellicon Labscale TFF system fitted with PLGC10 membrane unit 
(Millipore). The reaction can be concentrated to 25 ml using tangential flow, discarding 
the permeate (the scFv is localised to the retentate). The solution can be diafiltered against 
40 x turn-over volumes (1 L) of 10 mM ammonium acetate (AAT) pH 9.0. Finally, the 
volume of the antibody can be diluted to 50 ml using 10 mM AAT pH 9.0. The buffer 

20 exchanged antibody can be stored for 2 hours at 4 °C. The typical protein content was 1 - 
2 mg ml" 1 with a yield of up to 50 mg per litre. 

For scFv purification a 10 ml bed volume of Ni NTA superflow agarose Qiagen can be 
prepared according to the manufacturers instructions using a 10 ml glass column (Sigma) 
25 without flow adaptors. The column can be equilibrated with 50 ml Buffer B (6M urea, 0. 1 
M NaH 2 P0 4 , 10 mM Tris HC1 pH 8.0). Refolded scFv (as described above)can be diluted 
1/5 in Buffer B and applied directly to the column at ml min" 1 . 50 ml buffer B can be 
applied to the column (flow rate 5 ml min" 1 ) followed by 50 ml Buffer C (same 
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composition as buffer B but pH 6.3). The purified scFv can be eluted from the column by 
slowly applying Buffer B supplemented with 250 mM high grade imidazole until the A 280 
< 0.05. 

5 Results 

The variable heavy chain (VH) and variable light chains of the antibody genes from 
peripheral blood lymphocytes (PBLs) isolated from C. difficile infected patients have been 
TA cloned and sequenced. In each case the third complementarity determining region (i.e. 
CDR-H3 or CDR-L3) was identified. This region is both the most variable region of the 
10 VH chain and the area identified as being most important in determining antigen binding, 
for this reason this area is used as the signature for each VH chain. 

Clinical Information 

Patient D01 - Variable heavy chains (VH's) and variable light chains (VL's) were 
15 sequenced from PBLs taken ten days after the onset of CDAD. Lab results confirmed C. 
difficile toxin was present. 

Patient D02 - VH's and VL's were sequenced from PBLs taken ten days after the onset of 
CDAD. Lab results confirmed C. difficile toxin was present. In the seven subsequent 
20 weeks after sample was taken, the patient did not develop any further C. difficile 
infections, suggesting that the antibodies produced in response to the infection were 
protective. 

Patient D03 - VH's and VL's were sequenced from PBLs taken ten days after the onset of 
25 CDAD. Lab results confirmed C. difficile toxin was present. In the three subsequent weeks 
after sample was taken, the patient did not develop any further C. difficile infections, 
suggesting that the antibodies produced in response to the infection were protective. 
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Seqiiemcing results 

Patient D01 showed a high degree of focusing with both VH's and VL's. Three different 
CDR3 sequences make-up over 80 % of the sequenced VH's. VH's with the CDR3 
sequence of SEQ ID NO: 27 account for 57.8 % of sequenced VH's from this patient. 
5 VH's with the CDR3 sequence of SEQ ID NO: 28 account for 13.5 % of all VH's, and 
VH's the CDR3 sequence of SEQ ID NO: 29 account for 10 % of all sequenced VH's in 
this patient. Three different CDR3 sequences also make up the majority of sequenced 
VL's, these are the CDR3 sequences of SEQ ID NOs: 32, 34, and 33 (occurring at 
frequencies of 32.7 %, 15.3%, and 6.6%, respectively). 

10 

Similarity searches were performed using these common VH and/or VL sequences against 
other VH and/or VL sequences contained within a database (FABTEC database), which 
contains VH and/or VL regions of PBLs isolated from patients with methicillin-resistant 
Staphylococcus aureus (MRSA), Candida albicans, Pseudomonas aeruginosa (the 

15 causative agent of cystic fibrosis), Streptococcus oralis and vancomycin-resistant 
Enterococci (VRE) infections and to identify antibody sequences which are associated with 
resistance to the infections and which can therefore be used to effect therapy against those 
infections. These sequences have formed the basis of WO 01/76627. For all of these, the 
isolated VH and/or VL sequences demonstrate skewing of the antibody repertoire over the 

20 course of infection, thus revealing the identity of matured, immunity-conferring VH and/or 
VL sequences. This has typically been done using 5,000 to 15,000 sequences for each VH 
and/or VL. VH and/or VL repertoires from healthy individuals can also be used for 
comparison purposes. 

25 Using the BLOSUM50 matrix it was possible to pick out groups of antibodies with 
CDR3's that had greater than 70% similarity. The CDR3 sequences of SEQ ID NOs: 27 
and 28 (Table 2 and 3, respectively), showed sequence similarity of greater than 70% only 
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with other VH's from C. difficile infected patients VH's sequenced from individuals with 
other infections (C. albicans, MRSA and VRE) showed no similar CDR3 sequences. 

In the following Tables, libraries starting D are derived from C. difficile infected patients, 
5 libraries starting M are derived from MRSA infected patients, libraries starting V are 
derived from VRE infected patients; and libraries starting C are derived from C. albicans 
infected patients. 



Table 2 



CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


27 


100.0 


D01 


184 


27 


100.0 


D03 


5 


35 


100.0 


D01 


1 


36 


100.0 


D01 


3 ! 


37 


100.0 


D01 


1 


37 


100.0 


D03 


1 


38 


93.75 


D01 


1 



Table 3 



CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


28 


100.0 


D01 


43 


28 


100.0 


D03 


1 


39 


100.0 


D01 


1 


40 


88.88 


D01 


1 



25 

VH's with the CDR3 sequence of SEQ ID NO: 29 showed similarity to VH's from 
other C. difficile infected patients and MRSA infected patients (Table 4). 
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CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


41 


100.0 


D01 


1 


29 


100.0 


D01 


32 


29 


100.0 


M01 


1 


29 


100.0 


M02 


5 


29 


100.0 


M03 


2 


29 


100.0 


M04 


8 


42 


100.0 


M03 




43 ; 


100.0 


D01 




44 


93.75 


D01 




45 


93.75 


D01 




46 


75.0 


M04 




47 


70.58 


M05 





15 

The CDR3 sequences from the VL's of patient D01 have also had a similarity search 
performed on them. VL's with the CDR3 sequences of SEQ ID NOs: 32 and 33 (Tables 
5 and 6, respectively) show a high degree of homology with CDR3 sequences from C. 
difficile infected patients but no homology to CDR3 sequences derived from patients with 
20 different clinical infections. VL's with the CDR3 sequence of SEQ ID NO: 34 showed 
similarity to VL's isolated from C. difficile, MRSA and VRE infected patients (Table 7). 
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Table 5 



15 



CDR3 SEQIDNO: 


Similarity 


Library 


No. of clones 


32 


100.0 


D51 


64 


48 


90.9 


D52 


2 


49 


90.9 


D52 


1 


50 


90.9 


D52 


1 


51 


90.0 


D51 


1 


52 


90.0 


D52 




53 


81.8 


D52 




54 


81.8 


D52 




55 


81.8 


D52 




56 


81.8 


D52 




57 


80.0 


D52 




58 


72.7 


D52 




Table 6 


CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


33 


100.0 


D51 


13 


59 


90.9 


D52 


1 



20 
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CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


60 


83.3 


D52 


1 


61 


81.8 


D52 


4 


62 


81.8 


D52 


1 


63 


81.8 


D52 




64 


81.8 


D52 




65 


81.8 


D51 




66 


81.8 


D52 




67 


81.8 


D52 




68 


72.7 


D52 




69 


72.7 


D52 





Table 7 



CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


34 


100.0 


D51 


30 


34 


100.0 


M56 


2 


34 


100.0 


V54 


2 


70 


88.9 


D51 


1 


71 


88.9 


D51 


1 


72 


88.9 


V54 


2 


73 


77.9 


M55 


1 


74 


77.9 


D51 


1 


75 


77.9 


M55 


1 


76 


77.9 


M55 


1 



WO 2004/094474 



PCT/GB2004/001619 



-42- 

Patient D02 also showed a high degree of focusing of the VH repertoire, although focusing 
is less obvious in the VL repertoire. Two types predominate in the VH library of this 
patient. VH's with the CDR3 sequence of SEQ ID NO: 30 accounted for 45.6% of 
sequenced clones and those with the CDR3 sequence of SEQ ID NO: 31 accounted for 
5 8.8% of the library. In the VL library the three most frequently isolated clones with the 
CDR3 sequences of SEQ ID NOs: 61, 96, and 97 were found at 4%, 4% and 3%, 
respectively. 

Similarity searches in the FABTEC database showed the two common VH CDR3 
10 sequences to be specific to C. difficile patients (Tables 8 and 9). VH's with the CDR3 
sequence of SEQ ID NO: 30 match with other antibodies in that library and antibodies 
from library D03 (Table 8). VH's with the CDR3 sequence of SEQ ID NO: 3 1 match with 
antibodies from library D03 (Table 9). 

15 Table 8 



CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


77 


100.0 


D02 




78 


100.0 


D02 




30 


100.0 


D02 


52 


30 


100.0 


D03 




79 


100.0 


D02 




79 


100.0 


D03 




80 


94.4 


D02 




81 


94.4 


D02 




82 


94.4 


D02 




83 


88.8 


D02 




84 


77.7 


D03 
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Table 9 



CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


31 


100.0 


D02 


10 


31 


100.0 


D03 


4 


85 


90.0 


D03 


1 


86 


90.0 


D03 


1 



Similarity searches using some of the VL's showed a clustering of sequences. 15% of VL's 
10 from patient D02 (library D52) showed high degree of similarity to the CDR3 sequence 
of SEQ ID NO: 48, VL's from patient D01 (library D51) also showed a high degree of 
homology to this sequence (Table 10). In a similar way a cluster around the sequence of 
SEQ ID NO: 61 accounts for 13% of patient D02 sequenced VL's, again clones similar to 
this one are also found in patient D01 (Table 11). Neither of these clones show any 
15 significant similarity to VL CDR3 sequences isolated from patients infected with other 
pathogens. It should also be note that using the BLOSUM50 matrix these two clones show 
greater than 60% similarity to each other. 



Table 10 



CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


48 


100.0 


D52 


2 


49 


100.0 


D52 


1 


53 


100.0 


D52 


1 


32 


90.9 


D51 


64 


87 


81.8 


D52 


2 


51 


81.8 


D51 




88 


81.8 


D52 


1 
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Table 10 continued! 



CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


57 


72.7 


D52 


1 


89 


72.7 


D52 


1 


50 


72.7 


D52 


1 


54 


72.7 


D52 


1 


55 


72.7 


D52 


2 



Table 11 



CDR3 SEQ ID NO: 


Similarity 


Library 


No. of clones 


61 


100.0 


D52 


4 


65 


100.0 


D51 




59 


81.8 


D52 




33 


81.8 


D51 


13 


62 


81.8 


D52 




63 


81.8 


D52 




60 


75.0 


D52 




64 


72.7 


D52 




66 


72.7 


D52 




68 


72.7 


D52 




67 


72.7 


D52 





Patient D03 has only preliminary results for VH's but shows many similar sequences to 
both D01 and D02 so are worth reporting. Only data from 49 VH sequences have been 
25 produced. 10% of the clones have the CDR3 sequence of SEQ ID NO: 27, this is the 
sequence of the most common VH in patient D01. 8% of VH's have the CDR3 sequence 
of SEQ ID NO: 3 1 - this is the second most common VH CDR3 sequence in patient D02. 
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Cross-Library comparison (Table 12) of the CDR3 sequences of VH's has shown 8 CDR3 
sequences which appear in more than one of the C. difficile infected patients. The CDR3 
sequences of SEQ ID NOs: 27, 28, and 37 appear in patients D01 and D03 and the 
sequences of SEQ ID NOs: 31, 90, 91, 79, and 30 all appear in patients D02 and D03. 
5 So far patients D01 and D02 show no common CDR3 sequences. VL's have only been 
sequenced from patients D01 and D02, these patients share no identical CDR3 sequences. 



Table 12 



VH CDR3 SEQ ID NO: 


Patient 


27 


D01 andD03 


28 


D01 andD03 


37 


D01 and D03 


31 


D02 and D03 


90 


D02andD03 


91 


D02andD03 


79 


D02 and D03 


30 


D02 and D03 



Summary 

20 The presence of a large number of a particular VH CDR3 sequence in a patient with a 
C. difficile infection indicates that VH as part of an scFv may be protective against the 
organism. CDR3 sequences that are shared by more than one patient indicate specific 
C. difficile sequences. It should also be noted that sequences with a high degree of 
homology often due to somatic mutation within the patient may also represent 

25 important sequences with similar but subtly different properties. Analysis of the CDR3 
sequences of the VH from our individuals has lead to the identification of 17 
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potentially protective VH's. These VH CDR3 sequences correspond to SEQ ID NOs: 
30, 79, 28, 91, 41, 39, 38, 27, 36, 36, 40, 37, 90, 29, 44, 45, and 43. 

A similar analysis of the VL has identified 44 different VL CDR3 sequences. The larger 
5 number is due to the increased rate of somatic mutation in antibody light chains leading 
to VL's with slightly different sequences. These VL CDR3 sequences correspond to SEQ 
ID NOs: 59, 60, 61, 33, 62, 63, 64, 65, 66, 77, 84, 78, 30, 79, 80, 81, 82, 87, 57, 74, 85, 31, 
86, 82, 51, 48, 88, 49, 53, 32, 52, 70, 71, 34, 89, 68, 67, 50, 54, 55, 56, 59, 58, 96, and 97. 
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1 . Aii antibody or an antigen binding fragment thereof having the CDR-H3 sequence 
selected from the group consisting of: SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 
29, SEQ ID NO: 30, and SEQ ID NO: 31. 

2. An antibody or an antigen binding fragment thereof having the CDR-L3 sequence 
selected from the group consisting of: SEQ ID NO: 32, SEQ ID NO: 33, and SEQ ID NO: 
34. 

3. An antibody or an antigen binding fragment thereof having a CDR-H3 sequence 
selected from the group consisting of: SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29, 
SEQ ID NO: 30, and SEQ ID NO: 31, and a CDR-L3 sequence selected from the group 
consisting of: SEQ ID NO: 32, SEQ ID NO: 33, and SEQ ID NO: 34. 

4. A method for identifying candidate sequences of at least the CDR3 region of 
antibodies specific against at least one antigen produced by Clostridium difficile during an 
infection or against a vaccine, comprising the steps of: 

(i) with B cells isolated from at least one patient who has been infected by 
Clostridium difficile or administered said vaccine, sequencing at least the 
CDR3 region of the VH and/or VL coding regions of said B cells; and 

(ii) correlating said sequenced at least the CDR3 regions of the VH and/or VL 
coding regions of said B cells from said at least one patient to identify a set 
of candidate sequences for at least a CDR3 region of antibodies specific 
against said at least one antigen produced by Clostridium difficile or against 
said vaccine, each of said set of candidate CDR3 sequences or a sequence 
having at least 80% homology therewith occurring in total at a frequency of 
at least 1 percent in the set of sequences determined at step (i). 
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5. A method according to claim 4, said B cells being selected from the group 
consisting of peripheral B-cell lymphocytes and B cells from the spleen. 

6. A method according to claim 5, said peripheral B-cell lymphocytes being isolated 
5 from blood from said at least one patient. 

7. A method according to any of claims 4-6, said at least one antigen being an 
immunogen. 

10 8. A method according to any of claims 4-7, said at least one patient displaying a 
pronounced antibody response in response to infection by Clostridium difficile. 

9. A method according to any of claims 4-8, said at least one patient having recovered 
from infection by Clostridium difficile. 

15 

10. A method according to any of claims 4-9, said correlation step (ii) comprising 
determining putative amino acid sequences from said sequenced at least the VH and/or VL 
CDR3 coding regions, and correlating said putative amino acid sequences. 

20 11. A method according to claim 9, said correlation step (ii) comprising identifying the 
Complementarity Determining Regions comprised in said at least the VH and/or VL 
regions and correlating said Complementarity Determining Regions. 

12. A method according to claim 1 1 , said Complementarity Determining Regions being 
25 selected from the group consisting of CDR1 , CDR2 and CDR3 . 

13. A method according to any of claims 4-12, said correlation step (ii) additionally 
correlating at least one of the group consisting of: the strain of Clostridium difficile 
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infecting said at least one patient, the time point at which said B cells are isolated during 
infection of said at least one patient by Clostridium difficile, the age of said at least one 
patient, the sex of said at least one patient, and the race of said at least one patient. 

5 14. A method according to any of claims 4-13, said B cells having been isolated from 
said at least one patient at a plurality of time points during infection of said at least one 
patient by Clostridium difficile, said correlation step (ii) correlating the time point during 
infection of said at least one patient by Clostridium difficile at which said B cells are 
isolated. 

10 

15. A method according to any of claims 4-13, said B cells having been isolated from 
at least two patients, at least one of whom has recovered from infection by Clostridium 
difficile, and at least one of whom has not recovered from infection by Clostridium 
difficile, said correlation step (ii) correlating the recovery of said at least two patients from 

15 infection by Clostridium difficile. 

16. A method according to any of claims 4-13, said B cells having been isolated from 
at least two patients, said patients being infected by different strains of Clostridium difficile 
producing said at least one antigen, said correlation step (ii) correlating said sequenced at 

20 least the VH and/or VL coding regions of said B cells to identify a set of candidate 
sequences for antibodies, each of which is specific against at least one shared antigen 
produced by said different strains of Clostridium difficile or is specific against different 
antigens produced by said different strains of Clostridium difficile. 



25 



17. A method of manufacture of a medicament for the treatment of an infection by 
Clostridium difficile which produces at least one antigen, comprising the steps of: 
(i) performing a method according to any of claims 4-16; and 
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(ii) synthesising at least one antibody comprising at least one candidate sequence 
specific against said at least one antigen produced by Clostridium difficile. 

18. A method of manufacture of a medicament according to claim 1 7, comprising the 
step of combining said synthesised at least one antibody with a pharmaceutically 
acceptable carrier, diluent or excipient. 

19. A method of treatment of an infection of a patient by Clostridium difficile which 
produces at least one antigen, comprising the steps of: 

(i) performing a method according to any of claims 4-16; 

(ii) synthesising at least one antibody comprising at least one candidate 
sequence specific against said at least one antigen produced by Clostridium 
difficile-, and 

(iii) administering a therapeutically effective quantity of said at least one 
synthesised antibody to said patient. 

20. A method of producing a database which identifies candidate sequences for 
antibodies specific against at least one antigen produced by Clostridium difficile, 
comprising the steps of: 

(i) performing a method according to any of claims 4-16; and 

(ii) storing the data produced by said method in said database. 

21. A method of generating a report which identifies candidate sequences for antibodies 
specific against at least one antigen produced by Clostridium difficile, comprising the steps 
of: 

(i) performing a method according to any of claims 4-16; and 

(ii) producing a report comprising the data produced by said method. 
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22. A method for determining the efficacy of a vaccine, comprising the steps of: 

(i) with B cells isolated from at least one patient who has been administered 
said vaccine, sequencing at least the CDR3 region of the VH and/or VL 
coding regions of said B cells; and 

(ii) correlating said sequenced at least the CDR3 region of the VH and/or VL 
coding regions of said B cells to identify a set of candidate sequences for at 
least the CDR3 region of antibodies specific against said vaccine, each of 
said set of CDR3 candidate sequences or a sequence having at least 80% 
homology therewith occurring in total at a frequency of at least 1 percent in 
the set of sequences determined at step (i). 

23. A method according to claim 22, correlation step (ii) comprising correlating said 
sequenced at least a CDR3 region of the VH and/or VL coding regions of said B cells with 
sequenced at least a CDR3 region of the VH and/or VL coding regions of B cells isolated 
from at least one patient who has been infected with Clostridium difficile against which 
vaccination with said vaccine is intended to stimulate a protective immune response. 

24. A method according to either of claims 22 or 23, said correlation step (ii) 
additionally correlating at least one of the group consisting of: the time since 
administration of said vaccine to said at least one patient, the age of said at least one 
patient, the sex of said at least one patient, the race of said at least one patient, and the 
Complementarity Determining Regions of said sequenced at least the VH and/or VL 
coding regions. 

25. A method according to any of claims 22-24, said method determining the efficacy 
of said vaccine in stimulating a protective immune response against Clostridium difficile 
against which vaccination with said vaccine is intended to stimulate a protective immune 
response. 
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26. A method of producing a database which identifies the efficacy of a vaccine, 
comprising the steps of: 

(i) performing a method according to any one of claims 22-25; and 

(ii) storing the data produced by said method in said database. 

27. A method of generating a report which identifies the efficacy of a vaccine, 
comprising the steps of: 

(i) performing a method according to any one of claims 22-25; and 

(ii) producing a report comprising the data produced by said method. 

28. A diagnostic test method for identifying a Clostridium difficile infection in a patient, 
comprising the steps of: 

(i) with B cells isolated from said patient, sequencing at least the CDR3 region 
of the VH and/or VL coding regions of said B cells; 

(ii) comparing said sequenced at least said CDR3 region of the VH and/or VL 
coding regions of said B cells with a set of sequences for at least the CDR3 
region of antibodies specific against Clostridium difficile, and determining 
whether each of said set of CDR3 sequences or a sequence having at least 
80% homology therewith occurs in total at a frequency of at least 1 percent 
in the set of sequences determined at step (i); and 

(iii) correlating the results of comparison step (ii) to determine the presence or 
absence of a Clostridium difficile infection in said patient. 

29. A diagnostic test method for determining the susceptibility of a patient to 
Clostridium difficile infection, comprising the steps of: 

(i) with B cells isolated from said patient, sequencing at least the CDR3 region 
of the VH and/or VL coding regions of said B cells; 
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(ii) comparing said sequenced at least said CDR3 region of the VH and/or VL 
coding regions of said B cells with a set of sequences for at least the CDR3 
region of antibodies specific against Clostridium difficile, and determining 
whether each of said set of CDR3 sequences or a sequence having at least 
80% homology therewith occurs in total at a frequency of at least 1 percent 
in the set of sequences determined at step (i); and 

(iii) correlating the results of comparison step (ii) to determine the susceptibility 
of said patient to Clostridium difficile infection. 

30. A diagnostic test kit for performing a diagnostic test method according to any one 
of claims 28-29. 



WO 2004/094474 



PCT/GB2004/001619 



SEQUENCE LISTING 

<110> NeuTec Pharma PLC 

5 <120> Clostridium difficile Focussed Antibodies 

<130> GDM/JR-MP10 0462-WO 

<150> GB0309126.1 

10 <151> 2003-04-17 

<160> 97 

<170> Patentln version 3.2 

15 

<210> 1 

<211> 24 

<212> DNA 

<213> Artificial Sequence 

20 

<220> 

<223> PCR Primer 

<400> 1 

25 gtccaccttg gtgttgctgg gctt 



<210> 2 

<211> 24 

30 <212> DNA 

<213> Artificial Sequence 

<220> 

<22 3> PCR Primer 

35 

<400> 2 

agactctccc ctgttgaagc tctt 
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<210> 3 

<211> 24 

<212> DNA 

5 <213> Artificial Sequence 

<220> 

<22 3> PGR Primer 

10 <400> 3 

tgaggagacg gtgaccaggg tgcc 



<210> 4 

15 <211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

20 <223> PCR Primer 



<400> 4 

tgaagagacg gtgaccattg tccc 



<210> 5 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 



<400> 5 

35 tgaggagacg gtgaccaggg ttcc 24 
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<210> 6 

<211> 24 

<212> DNA 

<213> Artificial Sequence 

5 

<220> 

<223> PGR Primer 

<400> 6 
10 tgaggagacg gtgaccgtgg tccc 



<210> 7 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR Primer 



20 



<400> 7 

caggtgcagc tggtgcagtc tgg 



25 <210> 
<211> 



<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> PCR Primer 



<400> 8 

caggtgaact taagggagtc tgg 



<210> 9 



WO 2004/094474 



PCT/GB2004/001619 



<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> PGR Primer 

<400> 9 

caggtgcagc tggtggagtc tgg 



<210> 10 

<211> 23 

<212> DNA 

15 <213> Artificial Sequence 

<220> 

<223> PCR Primer 

20 <400> 10 

caggtgcagc tgcaggagtc ggg 



<210> 11 

25 <211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

30 <223> PCR Primer 

<400> 11 

caggtgcagc tacagcagtg ggg 



35 



<210> 12 
<211> 23 
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<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PGR Primer 



<400> 12 

gaggtgcagc tgttgcagtc tgc 



<210> 13 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PCR Primer 



<400> 13 
20 caggtacagc tgcagcagtc agg 



<210> 14 

<211> 24 

25 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> PCR Primer 

30 

<400> 14 

acgtttgatt tccaccttgg tccc 



35 <210> 15 
<211> 24 
<212> DNA 
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<213> Artificial Sequence 



<220> 

<223> PGR Primer 



<400> 15 

acgtttgatc tccagcttgg tccc 



10 <210> 16 

<211> 24 

<212> DNA 

<213> Artificial Sequence 

15 <220> 

<223> PCR Primer 

<400> 16 

acgtttgata tccactttgg tccc 



<210> 17 

<211> 24 

<212> DNA 

25 <213> Artificial Sequence 

<220> 

<2 23> PCR Primer 

30 <400> 17 

acgtttgatc tccaccttgg tccc 



<210> 18 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> PCR Primer 



<400> 18 

acgtttattc tccagtcgtg tccc 



<210> 19 

<211> 23 

10 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> PCR Primer 



15 



<400> 19 

gacatccaga tgacccagtc tec 



20 <210> 
<211> 



<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> PCR Primer 



gatgttgtga tgactcagtc tec 

30 

<210> 21 
<211> 23 
<212> DNA 
35 <213> Artificial Sequence 



<220> 
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PGR Primer 



<400> 21 

gaaattgtgt tgacgcagtc tec 



<210> 22 

<211> 23 

<212> DNA 

10 <213> Artificial Sequence 

<220> 

<223> PCR Primer 

15 <400> 22 

gacatcgtga tgacccagtc tec 



<210> 23 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> PCR Primer 

<400> 23 

gaaactacac tcacgcagtc tec 



<210> 24 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> PCR primer 
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<400> 24 

gaaattgtgc tgactcagtc tec 



5 <210> 25 

<211> 17 

<212> DNA 

<213> Artificial Sequence 

10 <220> 

<223> PCR Primer 

<400> 25 

gtaaaacgac ggccagt 



<210> 26 

<211> 16 

<212> DNA 

20 <213> Artificial Sequence 

<220> 

<223> PCR Primer 

25 <400> 26 

aacagctatg accatg 



<210> 27 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 27 



Glu lie Arg Ala Pro Asp His His Asp Phe Ser Gly Tyr Leu Gly Arg 
15 10 15 
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<210> 28 

<211> 9 

<212> PRT 

5 <213> Homo sapiens 

<400> 28 



10 



Asp lie Ser Ala Gly Gly Leu Asp Val 



<210> 29 

<211> 16 

<212> PRT 

<213> Homo sapiens 



Asn Val Gly Ser Gly Ser Tyr Tyr Thr Gly Gly His Trp Phe Asp Pro 



<210> 30 

25 <211> 18 

<212> PRT 

<213> Homo sapiens 



<400> 30 

30 

Asp Gly Val Arg Gin Tyr Ser Gly Gly Arg Tyr Ser Asn His Gly Met 
15 10 15 



35 Asp Val 
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<210> 31 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 31 

Leu Thr Ala Ala Gly Gly His Phe Asp Pro 



<210> 32 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 32 

Asn Ser Arg Asp Ser Thr Gly Asn Gin Leu 



<210> 33 

<211> 11 

<212> PRT 

<213> Homo sapiens 



Ala Ala Trp Asp Asp Ser Leu Ser Glu Phe Leu 



<210> 34 

<211> 9 

<212> PRT 

<213> Homo sapiens 
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Gln Gin Tyr Lys Gly Tyr Pro Leu Thr 



<210> 35 

<211> 16 

<212> PRT 

<213> Homo sapiens 



Glu lie Arg Ala Pro Asp His His Asp Leu Ser Gly Tyr Leu Gly Arg 



<210> 36 

<211> 16 

<212> PRT 

<213> Homo sapiens 



Glu He Arg Ala Pro Asn His His Asp Phe Ser Gly Tyr Leu Gly Arg 



<210> 
30 <211> 



35 



<212> PRT 

<213> Homo sapie: 



Glu Val Arg Ala Pro Asp His His Asp Phe Ser Gly Tyr Leu Gly Arg 
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<210> 38 

<211> 16 

<212> PRT 

<213> Homo sapiens 



Glu lie Arg Ala Pro Asp His His Asp Phe Ser Gly Tyr Leu Gly Cys 



10 



<210> 39 

<211> 9 

15 <212> PRT 

<213> Homo sapiens 

<400> 39 

20 Asp Val Ser Ala Gly Gly Leu Asp Val 



<210> 
25 <211> 



30 



<212> PRT 

<213> Homo sapiens 



<400> 40 



Glu lie Ser Ala Gly Ala Leu Asp Val 



35 <210> 
<211> 
<212> 



41 
16 
PRT 
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15 



30 



35 



<213> Homo sapiens 



<400> 41 



5 Asp Val Gly Ser Gly Ser Tyr Tyr Thr Gly Gly His Trp Phe Asp Pro 



<210> 42 

<211> 16 

<212> PRT 

<213> Homo sapiens 



<400> 42 



Asn Val Gly Ser Gly Ser Tyr Tyr Thr Gly Gly His Trp Phe Glu Pro 
15 10 15 



<210> 43 

<211> 16 

<212> PRT 

<213> Homo sapiens 



25 <400> 43 



Asn Val Gly Ser Gly Ser Tyr Tyr Thr Gly Gly His Trp Leu Asp Pro 
15 10 15 



<210> 44 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 44 
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Asn Val Gly Ser Gly Ser Tyr Tyr Thr Gly Gly His Trp Phe Asp Thr 



<210> 45 

<211> 16 

<212> PRT 

<213> Homo sapiens 



Asn Val Gly Ser Gly Ser Tyr Tyr Thr Gly Gly His Trp Phe Gly Pro 



<210> 46 

<211> 15 

<212> PRT 

<213> Homo sapiens 

<400> 46 

Asn Val Gly Ser Gly Ser Tyr Tyr Thr Ala Thr Cys Phe Asp Pro 



<210> 47 

<211> 17 

<212> PRT 

<213> Homo sapiens 

<400> 4 7 



Gly Ala Lys His Phe Glu Gly Ser Gly Ser Trp Phe Ser Trp Phe Asp 
35 l 5 10 15 
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Pro 



<210> 48 

<211> 11 

<212> PRT 

<213> Homo sapiens 



Asn Ser Arg Asp Asn Thr Gly His His Val Val 



<210> 49 

<211> 11 

<212> PRT 

<213> Homo sapiens 



<400> 49 



Asn Ser Arg Asp Ser Ser Gly Asn His Val Val 



<210> 50 

<211> 11 

<212> PRT 

<213> Homo sapiens 



Ser Ser Arg Asp Asn Ser Gly Asp Arg Tyr Val 
35 1 5 10 
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<210> 51 

<211> 10 

<212> PRT 

<213> Homo sapiens 



Asn Ser Arg Asp Gly Thr Gly Asn Gin Leu 



<210> 52 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 52 

Asn Ser Arg Asp Thr Asn Gly Asp Gin Leu 



<210> 53 

<211> 11 

<212> PRT 

<213> Homo sapiens 



Asn Ser Arg Asp Ser Ser Gly Tyr His Val lie 



<210> 54 

35 <211> 11 

<212> PRT 

<213> Homo sapiens 
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Ser Ser Arg Asp Ser Lys Gly His Arg Tyr Val 



<210> 55 

<211> 11 

<212> PRT 

10 <213> Homo sapiens 



Ser Ser Arg Asp Ser Asn Gly Asn Arg Tyr Val 



<210> 56 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 56 

Ser Ser Arg Asp Thr Lys Gly His Arg Tyr Val 



<210> 57 

30 <211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 57 

35 

Lys Pro Arg Asp Ser Ser Gly Asn His Val 
15 10 
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<210> 58 

<211> 11 

<212> PRT 

<213> Homo sapiens 



Thr Ser Arg Asp Ser Asn Gly Asn Arg Tyr Val 



<210> 59 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 59 

Ala Ala Trp Asp Asp Asn Leu Ser Ala Tyr Val 



30 



<210> 60 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 60 

Ala Ala Trp Asp Asp Ser Leu Gly Gly Lys Tyr Val 
15 10 



35 <210> 61 
<211> 11 
<212> PRT 
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<213> Homo sapiens 
<400> 61 

5 Ala Ala Trp Asp Asp Ser Leu Asn Gly Val Val 
15 10 



15 



<210> 62 

<211> 11 

<212> PRT 

<213> Homo sapiens 



Ala Ala Trp Asp Asp Ser Val Lys Gly Trp Val 



<211> 11 

<212> PRT 

<213> Homo sapiens 

25 <400> 63 

Ala Ala Trp Asp Asp Ser Val Arg Gly Trp Val 
15 10 

30 

<210> 64 

<211> 11 

<212> PRT 

<213> Homo sapiens 

35 

<400> 64 
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Ala Ala Trp Asp Asp Ser Val Arg Ser Trp Val 



<210> 65 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 65 

Ala Ser Trp Asp Asp Ser Leu Asn Gly Val Val 



<210> 66 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 66 

Ala Ser Trp Asp Asp Ser Gin Ala Ala Leu Val 



<210> 67 

<211> 11 

<212> PRT ' 

<213> Homo sapiens 

<400> 67 



Ser Ala Trp Asp Ser Ser Leu Ser Thr Trp Val 
35 1 5 10 
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<210> 68 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 68 

Gin Ser Tyr Asp Ser Ser Leu Ser Gly Trp Val 



<210> 69 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 69 

Ser Ser Tyr Thr Asn Thr Asn Pro Tyr Val 



<210> 70 

<211> 9 

25 <212> PRT 

<213> Homo sapiens 

<400> 70 

30 Gin Gin Phe Lys Ser Phe Pro Leu Thr 



<210> 71 

<211> 9 

<212> PRT 

<213> Homo sapiens 
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<400> 71 



Gin Gin Phe Gin Ser Tyr Pro Val Thr 



<210> 72 

<211> 9 

<212> PRT 

<213> Homo sapiens 



Gin Gin Tyr Lys Ser Tyr Pro Leu Thr 



<210> 73 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 73 

His Gin Tyr Tyr Arg Phe Pro Leu Thr 



<210> 74 

30 <211> 9 

<212> PRT 

<213> Homo sapiens 



Leu Gin Tyr Lys Lys Trp Pro Leu Thr 
1 5 
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<210> 75 

<211> 9 

<212> PRT 

<213> Homo sapiens 



Gin His Phe Ser His Tyr Pro Leu Thr 



10 



<210> 76 

<211> 9 

<212> PRT 

<213> Homo sapiens 



Gin His Tyr Tyr Ser Tyr Pro Leu Thr 



30 



<210> 77 

<211> 18 

<212> PRT 

<213> Homo sapiens 

<400> 77 

Asp Gly Val Arg Gin Tyr Asn Gly Gly Arg Tyr Ser Asn His Gly Met 
15 10 15 



35 



Asp Val 
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<210> 78 

<211> 18 

<212> PRT 

<213> Homo sapiens 

5 

<400> 78 

Asp Gly Val Arg Gin Tyr Ser Gly Gly Lys Tyr Ser Asn His Gly Met 
15 10 15 

10 

Asp Val 



<210> 79 

<211> 18 

<212> PRT 

<213> Homo sapiens 



Asp Gly Val Arg Gin Tyr Ser Gly Gly Arg Tyr Ser Asn His Gly Met 



30 

<210> 



<211> 18 

<212> PRT 

<213> Homo sapiens 



WO 2004/094474 



PCT/GB2004/001619 



Asp Gly Val Ser Gin His Asn Gly Gly Arg Tyr Ser Asn His Gly Met 



5 Asp Val 



<210> 81 

10 <211> 18 

<212> PRT 

<213> Homo sapiens 



15 



30 



<400> 81 



Asp Gly Val Ser Gin Tyr Ser Gly Gly Arg Tyr Ser Asn His Gly Met 
15 10 15 



20 Asp Val 



<210> 82 

25 <211> 18 

<212> PRT 

<213> Homo sapiens 



<400> 82 



Asn Gly Val Arg Gin Tyr Ser Gly Gly Arg Tyr Ser Asn His Arg Met 
15 10 15 



35 Asp Val 



WO 2004/094474 



PCT/GB2004/001619 



5 



10 



15 



30 



35 



<210> 83 

<211> 18 

<212> PRT 

<213> Homo sapiens 

<400> 83 

Glu Glu Glu Arg Gin Tyr Ser Gly Gly Arg Tyr Ser Asn His Gly Met 



Asp Val 



<210> 84 

<211> 18 

<212> PRT 

<213> Homo sapiens 

<400> 84 

Asp Gly Val Arg Gin Tyr Ser Gly Ala Asp Thr Pro Asn His Gly Met 



<210> 85 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 85 
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Leu Thr Ala Ala Gly Ala His Phe Asp Pro 



<210> 86 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 86 

Leu Thr Ala Ala Gly Gly Arg Phe Asp Pro 



<210> 87 

<211> 11 

<212> PRT 

<213> Homo sapiens 

<400> 87 

His Ser Arg Asp Asp Thr His Tyr Pro Val lie 



<210> 88 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 88 



Asn Ser Arg Asp Ser Asn Asn His Val Val 
35 1 5 10 
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<210> 89 

<211> 9 

<212> PRT 

<213> Homo sapiens 



Gin Ser Arg Asn Asn Thr Tyr Val Leu 



<210> 90 

<211> 11 

<212> PRT 

<213> Homo sapiens 



His Thr Arg Asp Thr Leu Phe Pro Val Asp Phe 



<210> 91 

<211> 19 

<212> PRT 

<213> Homo sapiens 



Asp Met Gly Met Phe Cys Ser Gly Val He Cys Tyr Asp Tyr Tyr Gly 



Met Asp Val 

35 
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<210> 92 

<211> 4 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Used for defining CDR region 



<220> 

<221> MIS C_FEATURE 

<222> (3) . . (3) 

<223> Any amino acid 

<400> 92 

Phe Gly Xaa Gly 



<210> 93 

<211> 4 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Used for defining CDR region 



<220> 

<221> MI SC_FEATURE 

<222> (2) . . (4) 

<22 3> Any amino acid 

<400> 93 

Cys Xaa Xaa Xaa 
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<210> 94 

<211> 5 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Used for defining CDR region 

<400> 94 

Leu Glu Trp lie Gly 



<210> 95 

<211> 4 

20 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Used for defining CDR region 



<220> 

<221> MISC_FEATURE 

<222> (3) . . (3) 

<223> Any amino acid 



<400> 95 



Trp Gly Xaa Gly 
35 1 
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<210> 96 

<211> 10 

<212> PRT 

<213 > Homo sapiens 

<400> 96 

Gin Ser Tyr Asp Ser Ser Arg Gly Arg Val 



<210> 97 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 97 



Met Gin Gly lie Arg Pro Pro Arg Thr 
20 1 5 



INTERNATIONAL SEARCH REPORT 



In rnational Application No 

T/GB2004/001619 



According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Documentation searched other than minimum documentation to the extent th.?i si 



h documents are included in the fi£ 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 

EPO-Internal , BIOSIS, EMBL, EMBASE, MEDLINE, WPI Data 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Citation of document, with indication, where appropriate, of the re 



DE 197 39 685 A (EICHEL STREIBER CHRISTOPH 
VON) 11 March 1999 (1999-03-11) 
page 3, line 1 - line 11 
examples 1-6 



WO 00/12562 A (GENENTECH INC) 

9 March 2000 (2000-03-09) 

figure 2 

claim 8 

SEQ ID No. 15 



-/-- 



LU 1 



re listed in the continuation of box C. 



nt family members are listed in 



° Special categories of cited documents : 

document defining the general state of the art which is not 
considered to be of particular relevance 
earlier document but published on or after the international 



document which r 
which is cited to 



Ihrow doubts on priority claim(s) or 
iblish the publication date of another 
;ial reason (as specified) 
referring to an oral disclosure, use, exhibition 01 



"T" later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X" document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y" document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
menls, such combination being obvious to a person skilled 
in the art. 

■&" document member of the same patent family 



ie actual completion of the international se 



2 September 2004 



Date of mailing of the international se 



27/09/2004 



..„,__jn Patent Office, P.B. e 
NL - 2280 HV Rijswijk 
Tel. (+31-70) 340-2040, Tx. 3 
Fax: (+31-70) 340-3016 



Authorized officer 



Ulbrecht, M 



(second sheet) (January 2004) 



page 1 of 2 



INTERNATIONAL SEARCH REPORT 


International Application No 

T/GB2004/001619 


C.(Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT 


Category ° 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


X 


WO 97/20932 A (ALLEN DEBORAH JULIE ; 
CAMBRIDGE ANTIBODY TECH (GB); MCCAFFERTY 
JOHN GE) 12 June 1997 (1997-06-12) 

exampl e 1 
table 1 
figure lb 
claim 14 




1-3,30 


X 


WHITE HARRY ET AL: "Analysis of 

immunoglobulin (Ig) isotype diversity and 

IgM/D memory in the response to 

phenyl -oxazol one" 

JOURNAL OF EXPERIMENTAL MEDICINE, 

vol. 191, no. 12, 

19 June 2000 (2000-06-19), pages 

2209-2219, XP002293496 

ISSN: 0022-1007 

page 2210, left-hand column, paragraph 3 - 
page 2211, left-hand column, paragraph 1 




30 


X 


NIE XIA0B0 ET AL: "Immunization with 
immune complex alters the repertoire of 
antigen-reactive B cells in the germinal 
centers" 

EUROPEAN JOURNAL OF IMMUNOLOGY, 

vol. 27, no. 12, December 1997 (1997-12), 

pages 3517-3525, XP009035552 

ISSN: 0014-2980 

page 3524, left-hand column, paragraph 3 - 
right-hand column, paragraph 1 




30 


p,x 


W0 03/052416 A (MATTHEWS RUTH CHRISTINE ; 

NEUTEC PHARMA PLC (GB); RIGG GORDON 

PATRICK) 26 June 2003 (2003-06-26) 

the whole document 

in particular SEQ ID Nos. 37 and 50. 




1,3-27, 
30 


P,X 


HOLT L J ET AL: "Domain antibodies: 

proteins for therapy" 

TRENDS IN BIOTECHNOLOGY, ELSEVIER 

PUBLICATIONS, CAMBRIDGE, GB, 

vol. 21, no. 11, November 2003 (2003-11), 

pages 484-490, XP004467495 

ISSN: 0167-7799 

the whole document 




1-3, 
17-19 



Form PCT/ISA/210 (continuation of second sheet) (January 2004) 



page 2 of 2 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/GB2004/001619 



Box No. I Nucleotide and/or amino acid sequence(s) (Continuation of item 1 .b of the first sheet) 



a. type of material 

| x | a sequence listing 

f^J table(s) related to the sequence listing 

b. format of material 

| X | in written format 

| x 1 in computer readable form 

c. time of filing/furnishing 

x | contained in the international application as filed 

| filed together with the International application In computer readable form 
| furnished subsequently to this Authority for the purpose of search 

I In addition, in the case that more than one version or copy of a sequence listing and/or table relating thereto has been filed 

or furnished, the required statements that the information in the subsequent or additional copies is identical to that in the 
application as filed or does not go beyond the application as filed, as appropriate, were furnished. 

Additional comments: 



Form PCT/ISA/210 (continuation of first sheet (1)) (January 2004) 



INTERNATIONAL SEARCH REPORT 



international application No. 

PCT/GB2004/001619 



Box II Observations where certain claims were found unsearchable (Continuation of item 2 of first sheet) 

This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 

1. Claims Nos.: 19 

because they relate to subject matter not required to be searched by this Authority, namely: 

Although claim 19 is directed to a method of treatment of the human/animal 
body, the search has been carried out and based on the alleged effects of the 
compound/composition. 

2. Q Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 



3. I I Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box III Observations where unity of invention is lacking (Continuation of item 3 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 



1 . I I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
1 — 1 searchable claims. 

2. As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
— of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
' — ' covers only those claims for which fees were paid, specifically claims Nos.: 



4. I I No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 

restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest | | The additional search fees were accompanied by the applicant's protest. 

I I No protest accompanied the payment of additional search fees. 



Form PCT/ISA/210 (continuation of first sheet (2)) (January 2004) 



INTERNATIONAL SEARCH REPORT 

Information on patent family members 



rnational Application No 

T/GB2004/001619 



DE 
AT 
AU 
BR 
CA 
CN 
DE 
WO 
EP 
ES 
JP 
US 

us 



19739685 Al 

254139 T 

9742698 A 

9815367 A 

2303202 Al 

1273588 T 

59810172 Dl 

9912971 A2 

0994904 A2 

2210832 T3 

2001515920 T 

6667035 Bl 

2004137601 Al 



11-03-1999 
15-11-2003 
29-03-1999 
06-11-2001 
18-03-1999 
15-11-2000 
18-12-2003 
18-03-1999 
26-04-2000 
01-07-2004 
25-09-2001 
23-12-2003 
15-07-2004 



AT 


217889 


T 


15-06-2002 


AU 


766551 


B2 


16-10-2003 


AU 


5692399 


A 


21-03-2000 


BR 


9913435 


A 


25-09-2001 


CA 


2341276 


Al 


09-03-2000 


CN 


1317015 


T 


10-10-2001 


DE 


69901569 


Dl 


27-06-2002 


DE 


69901569 


T2 


19-12-2002 


DK 


1107996 


T3 


16-09-2002 


EP 


1107996 


Al 


20-06-2001 


ES 


2177316 


T3 


01-12-2002 


JP 


2002523081 


T 


30-07-2002 


NZ 


509430 


A 


26-04-2002 


PT 


1107996 


T 


31-10-2002 


WO 


0012562 


Al 


09-03-2000 


US 


6624295 


Bl 


23-09-2003 


ZA 


200100681 


A 


24-01-2002 



AT 


200516 


T 


15-04-2001 


AU 


703319 


B2 


25-03-1999 


AU 


1103697 


A 


27-06-1997 


CA 


2239519 


Al 


12-06-1997 


DE 


69612509 


Dl 


17-05-2001 


DE 


69612509 


T2 


18-10-2001 


DK 


865492 


T3 


11-06-2001 


EP 


0865492 


Al 


23-09-1998 


ES 


2157473 


T3 


16-08-2001 


WO 


9720932 


Al 


12-06-1997 


JP 


2000504204 


T 


11-04-2000 


US 


5872215 


A 


16-02-1999 



WO 03052416 A 26-06-2003 CA 2471570 Al 26-06-2003 

EP 1415002 A2 06-05-2004 

WO 03052416 A2 26-06-2003 



Form PCT/ISA/210 (patent family annex) (January 2004) 



